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NEISSERIA MENINGITIDIS ANTIGENS AND COMPOSITIONS 

This application is a continuation-in-part of the following U.S. Provisional Patent 
applications, from each of which priority is claimed, and each of which is incorporated by 
reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25, 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae. 

BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N. meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et al (1996) Safety and Immunogenicity of a Serogroups AJC Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
N EnglJ Med 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae, N meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups of N meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Africa. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C, Y and 
W135. Although efficacious in adolescents and adults, it induces a poor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)]. This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination against H.influenzae, conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD "New and Improved Vaccines Against Meningococcal Disease". In: 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra; Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (menB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of ct(2-8)-linked A^-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has, 
for instance, been chemically modified substituting the A^-acetyl groups with 7V-propionyl 
groups, leaving the specific antigenicity unaltered (Romero & Outschoorn (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559-575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins, 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variability, 
multivalent vaccines containing up to nine different porins have been constructed {eg. 
Poolman JT (1992) Development of a meningococcal vaccine. Infect. Agents Dis. 4:13-28). 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg. Ala'Aldeen & Borriello (1996) The meningococcal transferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(l):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins (e.g. EP-A-0467714, W096/29412), but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N. meningitidis or N. gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisserial DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. coli. 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed in E. coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed in E. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 
ORF 128-1 as cloned and expressed in E. coli. 

Fig. 7 illustrates the products of protein expression and purification of the predicted 
ORF 206 as cloned and expressed in E. coli. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 
ORF 287 as cloned and expressed in£. coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in E. coli. 

Fig. 1 1 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed in E. coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coli. 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E. coli. 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coli. 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coli. 

Fig. 17 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coli. 

Fig. 18 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 
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Fig. 20 shows an alignment comparison of amino acid sequences for ORF 235 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORF 919 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the N. meningitidis amino acid sequences 
and N. gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the TV. meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of functional 
equivalence. Identity between proteins is preferably determined by the Smith- Waterman 
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homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affine gap search with parameters: gap penalty 1 2, gap extension penalty 1 . 

The invention further provides proteins comprising fragments of the N. meningitidis 
amino acid sequences and N. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments should comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence, n is 7 or more {eg. 8, 10, 12, 14, 16, 18, 20 or more). 
Preferably the fragments comprise an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means {eg. 
recombinant expression, purification from cell culture, chemical synthesis etc.) and in various 
forms {eg. native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form {ie. substantially free from other N. meningitidis or N. gonorrhoeae host cell proteins) 

According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a further aspect, the invention provides nucleic acid comprising the 
N. meningitidis nucleotide sequences and N. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

Nucleic acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the N. meningitidis sequences or N. 
gonorrhoeae sequences and depending on the particular sequence, n is 10 or more {eg 12, 14, 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above {eg. for antisense or probing purposes). 

Nucleic acid according to the invention can, of course, be prepared in many ways {eg. 
by chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself etc.) and can take various forms (eg. single stranded, double stranded, vectors, 
probes etc.). 

In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) etc. 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg. expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as K gonorrhoeae) but are preferably N. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, comprising administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention under conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the inventiion, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Methodology - Summary of standard procedures and techniques. 
General 

This invention provides Neisseria meningitidis menB nucleotide sequences, amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists. In addition, with these sequences one can search to identify open 
reading frames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Molecular Cloning; A Laboratory Manual, Second Edition (1989); 
DNA Cloning, Volumes I and ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & S.J. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press, Inc.), 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer- Verlag, 
N.Y.), and Handbook of Experimental Immunology, Volumes I-TV (D.M. Weir and C.C. 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3 1 ) transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5' 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual 2nd ed.). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (GRE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1237; Alberts et 
al. (1989) Molecular Biology of the Cell, 2nd ed.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:761) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl. Acad. ScL 79:6111) and from human cytomegalovirus 
(Boshart et al. (1985) Cell 41:521). Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellularly in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in mammalian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3 1 to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3* terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Birnstiel et al. (1985) Cell 47:349; Proudfoot and Whitelaw (1988) "Termination and 3' end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. 
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Glover); Proudfoot (1989) Trends Biochem. ScL 14:105). These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terminator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual). 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcription termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 23:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the replicon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kaufinan et al. (1989) Mol Cell Biol. P:946) and pHEBO (Shimizu et al. (1986) Moi 
Cell Biol 6:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g., Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
in the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistry 30:3861- 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et al., Mol Gen. Genet. 209:33-40 (1987); Chandler 
et al., Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol Chem. 260:3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:2515-2535 (1987); Wirsel et aL, Molecular Microbiology 3:3-14 (1989); Yu et al., Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found in R.L. Jones and J. MacMillin, Gibberellins: in: Advanced Plant Physiology,. Malcolm 
B. Wilkins, ed., 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas et 
al., EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl Acad. Scl 84:1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired. plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin; a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the grass family, is found in Wilmink and Dons, 
1993, Plant Mol Biol Reptr, 1 1(2): 165-185. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also recommended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5 1 untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide which 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 
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The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DNA. Crossway, MoL Gen. Genet, 202:179-185, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et al., Nature, 296, 72-74, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al., Nature, 
327, 70-73, 1987 and Knudsen and Muller, 1991, Planta, 185:330-336 teaching particle 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid- surfaced bodies, Fraley, et al., Proc. Natl, Acad. Sci. USA, 79, 
1859-1863, 1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al., Proc. Natl Acad. Sci. USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in the presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants from which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solanum, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Alternatively, embryo formation can be induced from the protoplast suspension. 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medium, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

iii. Baculovirus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, inter 
alia, Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally known to 
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those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and Smith"). 

Prior to inserting the DNA sequence encoding the protein into the baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 77:31. 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. 
(1988) Ann. Rev. Microbiol, 42:111) and a prokaryotic ampicillin-resistance (amp) gene and 
origin of replication for selection and propagation in E. coli. 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5* to 3 f ) transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present, is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide 
particularly useful promoter sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al, (1986) "The Regulation of Baculovirus 
Gene Expression," in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler); EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the plO protein, Vlak et al., (1988), J. 
Gen. Virol 69:765. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 75:409). Alternatively, since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origin, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al., (1985), Nature 375:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et al., (1988), Molec. Cell Biol 5:3129; human IL-2, Smith et al., (1985) Proc. 
NaVlAcad. Set USA, 52:8404; mouse IL-3, (Miyajima et al., (1987) Gene 55:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99 ; can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfused foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus — usually by co- 
transfection. The promoter and transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smith supra; Ju et al. (1987); Smith et al., Mol Cell Biol (1983) 5:2156; and 
Luckow and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhedrin gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et al., (1989), 
Bioessays 4:91 . The DNA sequence, when cloned in place of the polyhedrin gene in the 
expression vector, is flanked both 5' and 3' by polyhedrin-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 \xm in size, are highly refractile, giving 
them a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol. 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, supra; Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for, inter 
alia: Aedes aegypti , Autographa californica, Bombyx mori, Drosophila melanogaster, 
Spodopterafrugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., 
(1985) J. Virol 56:153; Wright (1986) Nature 327:718; Smith et al., (1983) Mol Cell Biol 
5:2156; and see generally, Fraser, et al (1989) In Vitro Cell Dev. Biol 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g., Summers and Smith supra. 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or result from lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transformants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5 ! end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 
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initiate transcription of the lac operon in Escherichia coli (E. coli) (Raibaud et al. (1984) 
Annu. Rev, Genet. J 8:173), Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose (lac) (Chang et al. (1977) Nature 795:1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp) (Goeddel et aL (1980) Nuc. Acids Res. 5:4057; Yelverton et al. (1981) Nucl. 
Acids Res. 9:731; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase (bid) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes. " In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et al. (1981) 
Nature 292:128) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et al. (1983) Gene 25:167; de Boer et 
aL (1983) Proc. Natl. Acad. Sci. 50:21). Furthermore, abacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et aL (1986) J. MoL Biol 
759:1 13; Tabor et aL (1985) Proc Natl. Acad. Sci. 52:1074). In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in prokaryotes. InE. coli 9 the ribosome binding 
site is called the Shine-Dalgarno (SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-11 nucleotides upstream of the initiation codon 
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(Shine et al. (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E. coli 16S rRNA (Steitz et al. (1979) "Genetic signals and nucleotide sequences in 
messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaryotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al. (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual). 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in which case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is 
fused to the 5* end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et al (1984) Nature 309:810). Fusion 
proteins can also be made with sequences from the lacZ (Jia et aL (1987) Gene 60:197), trpE 
(Allen et al. (1987) J. Biotechnol. 5:93; Makoff et aL (1989) Gen. Microbiol 135:1 1), and 
Chey (EPO Publ. No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
et aL (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide comprised of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et al. 
(1983), in: Experimental Manipulation of Gene Expression ; Ghrayeb et al. (1984) EMBO J, 
3:2437) and the E. coli alkaline phosphatase signal sequence (phoA) (Oka et aL (1985) Proc. 
Natl Acad. Sci. £2:7212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B. 
subtilis (Palva et aL (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located 3* to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon, such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria. The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 10 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin 
(neomycin), and tetracycline (Davies et aL (1978) Annu, Rev. Microbiol. 52:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et aL (1982) Proa Natl. Acad. Sci. USA 79:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et aL (1981) Nature 
292:128; Amann et aL (1985) Gene 40:183; Studier et aL (1986) J. Mol. BioL 189:113; EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremoris (Powell et aL (1988) Appl. 
Environ. Microbiol. 54:655); Streptococcus lividans (Powell et aL (1988) Appl. Environ. 
Microbiol. 54:655), Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCh or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al (1989) FEMS Microbiol Lett. 60:273 ; 
Palva et al. (1982) Proc. Natl Acad. ScL USA 79:5582; EPO Publ. Nos. 036 259 and 063 953; 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al (1988) Proc. Natl Acad. 
Sci. 55:856; and Wang et al (1990) J. Bacteriol 7 72:949; use of Escherichia coli: Cohen et 
al (1973) Proc. Natl Acad. Sci. (59:2110; Dower al (1988) Nucleic Acids Res. 1 6:6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with ColEl- 
derived plasmids. In Genetic Engineering: Proceedings of the International Symposium on 
Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et al. (1970) J. Mol Biol 
53:159; Taketo (1988) Biochim. Biophys. Acta 949:318; use of Lactobacillus: Chassy et al. 
(1987) FEMS Microbiol Lett. 44:113; use of Pseudomonas: Fiedler et al (1988) Anal 
Biochem 7 70:38; use of Staphylococcus: Augustin et al. (1990) FEMS Microbiol. Lett. 
66:203; use of Streptococcus: Barany et al. (1980) J. Bacteriol 144:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. 
Ferretti and R. Curtiss HI); Perry et al (1981) Infect. Immun. 32:1295; Powell et al. (1988) 
Appl Environ. Microbiol. 54:655; Somkuti et al. (1987) Proc. 4th Evr. Cong. Biotechnology 
7:412. 

v. Yeast Expression 

Yeast expression systems are also known to one of ordinary skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3 1 ) transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5' 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, 
glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PH05 gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al. (1983) Proc. Natl. 
Acad. Sci. USA 80:1). 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either the ADH2, GAL4, GAL10, OR PHOS genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen et al. (1980) Proc. Natl. Acad. Sci. USA 77: 1 078; 
Henikoff et al. (1981) Nature 255:835; Hollenberg et -a/. (1981) Curr. Topics Microbiol 
Immunol. 95:119; Hollenberg et al. (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical, Environmental and 
Commercial Importance (eds. K.N. Timmis and A. Puhler); Mercerau-Puigalon et al. (1980) 
Gene 77:163; Panthier et al. (1980) Curr. Genet. 2:109;). 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5* terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin- specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., WO88/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha-factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha- factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcription termination sequences recognized by yeast are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaryotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et al. 
(1979) Gene 5:17-24), pCl/1 (Brake et al. (1984) Proc. Natl Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al. (1982) J. Mol. Biol 755:157). In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
al. 9 supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to result from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al. (1983) Methods in Enzymol 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr- Weaver et aL, supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al. (1983) Proc. 
Natl Acad. Sci. USA 50:6750). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 
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Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as ADE2, HIS4, LEU2, TRP1, and ALG7, and the G418 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable 
selectable marker may also provide yeast with the ability to grow in the presence of toxic 
compounds, such as metal. For example, the presence of CUP1 allows yeast to grow in the 
presence of copper ions (Butt et al (1987) Microbiol, Rev. 57:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et aL (1986) Mol. 
Cell Biol 6:142); Candida maltosa (Kunze, et al (1985) J. Basic Microbiol 25:141); 
Hansenula polymorpha (Gleeson, et al (1986) J. Gen. Microbiol 752:3459; Roggenkamp et 
al (1986) Mol Gen. Genet. 202:302); Kluyveromyces fragilis (Das, et al (1984) J. Bacteriol 
755:1 165); Kluyveromyces lactis (De Louvencourt et al (1983) J. Bacteriol 154:737; Van 
den Berg et al (1990) Bio/Technology 8: 135); Pichia guillerimondii (Kunze et al (1985) J. 
Basic Microbiol 25:141); Pichia pastoris (Cregg, et al (1985) Mol Cell Biol 5:3376; U.S. 
Patent Nos. 4,837,148 and 4,929,555); Saccharomyces cerevisiae (Hinnen et al (1978) Proc. 
Natl Acad. Sci. USA 75:1929; Ito et al (1983) J. Bacteriol. 755:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 300:706); and Yarrowia lipolytica (Davidow, et al 
(1985) Curr. Genet. 70:380471 Gaillardin, et al (1985) Curr. Genet. 70:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, 
and usually include either the transformation of spheroplasts or of intact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et al (1986) Mol Cell Biol 6:142; Kunze et al (1985) J. Basic 
Microbiol 25:141; Candida]; [Gleeson et al. (1986)7. Gen, Microbiol 752:3459; 
Roggenkamp et al (1986) Mol Gen. Genet. 202:302; Hansenula]; [Das et al (1984) J. 
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Bacteriol 755:1 165; De Louvencourt et al (1983) J. Bacteriol 154:1165; Van den Berg et al. 
(1990) Bio/Technology 5:135; Kluyveromyces]; [Gregg et al. (1985) Mol Cell Biol 5:3376; 
Kunze et al (1985) J. Basic Microbiol 25:141; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hinnen et al (1978) Proc. Natl Acad. Sci. USA 75;1929; Ito et al (1983) J. 
Bacteriol 753:163 Saccharomyces]; [Beach and Nurse (1981) Nature 500:706; 
Schizosaccharomyces]; [Davidow et al (1985) Curr. Genet. 70:39; Gaillardin et al (1985) 
Curr. Genet. 70:49; Yarrowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fragment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria, The value of x may be 50% or more, 
e.g., 66%, 75%, 80%, 90%, 95% or even 100% (i.e. the amino acid is found in the protein in 
question in all Neisseria). In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question from a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
sero groups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably linked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to replicate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RNA or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60%, 70%, 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an 
"allelic variant" of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5' or 3' untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies, Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 
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Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein fragments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins or protein fragments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 |ig/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25°C 
for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifugation (e.g., l,000g for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
{Nature (1975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B-cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and ligands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3,3',5,5'-tetramethylbenzidine (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor-ligand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
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into distinct classes, as the same label may serve in several different modes. For example, I 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with 125 I, or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein fragments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein fragments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. 
The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 
treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or immunogen(s), immunogenic 
polypeptide, protein(s) or protein fragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers," 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 
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liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants")- Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, K pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCT Publ. No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
microfluidizer such as Model HOY micro fluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, DL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g., gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT-K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2 f - 
dipalmitoyl-5/i-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), etc. 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutical^ acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutical^ acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immunologically effective amount", it is 
meant that the administration of that amount to an individual, either in a single dose or as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g., nonhuman primate, primate, etc.), the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation also 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutically acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e.g., 
Robinson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 
Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal, can 
be administered either locally or systemically. These constructs can utilize viral or non-viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, cororlavirus, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picornavirus, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1:51-64; Kimura (1994) Human Gene Therapy 5:845-852; 
Connelly (1995) Human Gene Therapy 6:185-193; and Kapiitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-X1, NZB-X2 and NZB9-1 (see O'Neill (1985) 7. 
Virol. 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virol. 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Virus, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 
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Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell lines, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 4070A and 1504A 
(Hartley and Rowe (1976) J Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC 
No. VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-190). 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary known retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP04 15731, EP0345242, EP0334301, 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936, WO94/03622, 
W093/25698, W093/25234, WO93/11230, WO93/10218, WO91/02805, WO91/02825, 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya (1992) JNeurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81 :6349; and Miller (1990) Human Gene Therapy 1. 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) 
Science 252:431, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/12649, WO93/03769, W093/19191, 
W094/28938, W095/11984, WO95/00655, WO95/27071, W095/29993, W095/34671, 
WO96/05320, WO94/08026, WO94/11506, WO93/06223, WQ94/24299, WO95/14102, 
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W095/24297, WO95/02697, W094/28152, W094/24299, WO95/09241, WO95/25807, 
WO95/05835, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum. Gene Ther. 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 18 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987) J. 
Virol 61:3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albumin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241:1667-1669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (1992) Human Gene Therapy 3:1 1-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR-1 247), Middleberg virus (ATCC VR-370), Ross 
River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephalitis virus (ATCC 
VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1 995 ,W094/2 1792, WO92/10578, 
WO95/07994, US 5,091,309 and US 5,217,879 are employable. Such alpha viruses may be 
obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) J. Biol. Standardization 1:115; rhinovirus, for example ATCC VR-1 1 10 and 
those described in Arnold (1990) J Cell Biochem L401; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-1 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch (1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NY Acad Sci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,1 12 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73:1533; influenza virus, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJ Med 
309:13, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-03 86882 and in Buchschacher (1992) J. Virol. 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375; O'Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre (1966) Proc Soc Exp Biol Med 121 : 190. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787, filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
J Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030 5 filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5,206,152 and in W092/1 1033, 
nucleic charge neutralization or fusion with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:2411-2418 and in Woffendin (1994) Proc Natl 
AcadSci9\:\5%\-\5%5. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987) 7. Biol Chem. 262:4429-4432, insulin as described in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U.S. 5,422,120, 
W095/13796, W094/23697, W091/14445 and EP-524,968. As described inUSSN. 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non- viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in Woffendin et 
al (1994) Proc. Natl. Acad. Sci. USA 91(24):1 1581-1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206,152 and W092/1 1033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; inWO 95/13796; W094/23697; and W091/14445; in EP- 
0524968; and in Stryer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1; Bayer (1979) Biochem Biophys Acta 550:464; 
Rivnay (1987) Meth Enzymol 149:119; Wang (1987) Proc Natl Acad Sci 84:7851; Plant 
(1989) Anal Biochem 176:420. 
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A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg. W093/14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interleukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 

B. Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di- ? or polysaccarides can be included. In a preferred embodiment of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1 :1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta. 1097:1-17; 
Straubinger (1983) Meth. Enzymol 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations. Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl. 
Acad, ScL USA 84:7413-7416); mRNA (Malone (1989) Proc. Nail Acad. Sci. USA 
86:6077-6081); and purified transcription factors (Debs (1990) J. Biol. Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and 



WO 99/057280 



45 



PCT/US99/09346 



DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg. Szoka (1978) Proc. Natl. Acad: ScL 
USA 75:4194-4198; WO90/1 1092 for a description of the synthesis of DOTAP 
(1 ,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg. Straubinger (1983) Meth. 
Immunol 101:512-527; Szoka (1978) Proc, NatL Acad, ScL USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl. Acad. Sci. USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl. Acad. ScL USA 76:145; Fraley (1980)7. Biol. Chem. (1980) 255:10431; 
Szoka & Papahadjopoulos (1978) Proc. NatL Acad, ScL USA 75:145; and Schaefer-Ridder 
(1982) Science 215:166. 
E.Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, DDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 
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isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, AI, All, AIV; CI, CII, CIIL 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoproteins are known and are described in, for example, 
Breslow (1985) Annu Rev. Biochem 54:699; Law (1986) Adv. Exp Med. Biol. 151:162; Chen 
(1986) J Biol Chem 261:12918; Kane (1980) Proc Natl Acad Sci USA 77:2465; and Utermann 
(1984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mainly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
in Meth. Enzymol. 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifugation, for 
instance. Such methods are described in Meth. Enzymoi (supra); Pitas (1980) J. Biochem. 
255:5454-5460 and Mahey (1979) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev 
Biophys Chem 15:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 
Techniologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et al., PCT. Appln. 
No. US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, 
polyarginine, polyornithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1, Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. LipofectinD, and lipofectAMINED are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the appropriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization" refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt ? s reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et ah [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and salt concentration should be chosen that is approximately 120 to 200 DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in preliminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et ah at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1) the 
complexity of the DNA being blotted and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitude of 10, from 0. 1 to 1 ng for a plasmid or phage digest to 10" 9 to 10" 8 g for a single 
copy gene in a highly complex eukaryotic genome. For lower complexity polynucleotides, 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jag of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10 8 cpm/jag. For a 
single-copy mammalian gene a conservative approach would start with 10 fig of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10 8 cpm/jag, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of interest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and formamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm- 81 + 16.6(logi 0 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/n-1.5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) Anal Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ie. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as a result, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42 DC for a probe with is 95% to 100% homologous to the target fragment, 37 DC for 90% 
to 95% homology, and 32 DC for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start with both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 
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washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PCR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisseria! nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, the native Neisserial sequence is preferred 
because it is the actual sequence present in ceils. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The probe sequence need not be identical to the Neisserial sequence (or its 
complement) — some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5' end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 
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Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci et al [J. Am, Chem. Soc. (1981) 103:3185], or according to Urdea et al [Proa 
Natl. Acad Sci. USA (1983) 80: 7461], or using commercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt et al (1993) TIBTECH 1 1:384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al in 
international patent application WO92/02526 [see also US patent 5,124,246]. 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is described in: Mullis et al [Meth. 
Enzymol (1987) 155: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra], mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 

EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
N. meningitidis, and N. gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in N. meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified from N. gonorrhoeae 

• the putative translation product of said N. gonorrhoeae sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified from strain A of N. meningitidis 

• the putative translation product of said N. meningitidis strain A sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg. 
see also Altschul et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402]. Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PLR sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 
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alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived from combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et al. 
[Critical evaluation of the hydropathy of membrane proteins (1990) Eur J Biochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and N. gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention {e.g. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

N. meningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifugation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HC1, 50mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by 
adding 10ml of lysis solution (50mM NaCl, 1% Na-Sarkosyl, 50|ag/ml Proteinase K), and the 
suspension incubated at 37°C for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCb/isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by reading 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonocbccus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5 ' 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
(BamHl-Ndel, BamHl-Nhel, EcoRl-Ndel or EcoRl-Nhel), depending on the restriction pattern 
of the gene of interest. The 3' primers included a Xhol or a Hindlll restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHl-Xhol, BamHl-Hindlll, EcoRl-Xhol or EcoRVHindlll), and pET2 lb+ (using Ndel- 
Xhol, Nhel-Xhol, Ndel-Hindlll or Nhel-Hindm). 

5'-end primer tail: CGC GGATCCCATATG (BamHl-Ndel) 

CGC GGATCCGCTAGC (BamHl-Nhel) 
C C G GAATTC TA CATATG (EcoRl-Ndel) 
CCG GAATTC TA GCTAGC (EcoRI-Nhel) 

3'-end primer tail: CCCG CTCGAG (Xhol) 

CCCGCTCGAG (Hindlll) 

For cloning ORFs into the pGEX-His vector, the 5' and 3' primers contained only one 
restriction enzyme site (EcoRl, Kpn\ or Sail for the 5 ' primers and Psil, Xbal, Sphl or Sail for 
the 3' primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1). 

5'-end primer tail: (AAA) AAA GAATTC (EcoRl ) 

( AAA ) AAAGGTACC (Kpnl) 
3'-end primer tail: (AAA) AAA CTGCAG (Pstl) 

( AAA ) AAATCTAGA (Xbal) 
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AAA GCATGC (SphJ) 
5' or 3'-end primer tail: AAAAAA GTCGAC (Sa/I) 

As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 

T m = 4 (G+C)+ 2 (A+T) (tail excluded) 

T m = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and 50-55°C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5' and/or 3 5 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain, Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted from the columns in 2.0ml NH 4 OH, and deprotected by 5 hours incubation at 56°C. 
The oligos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in either 100|il or 1 .0ml of water. The 
OD260 was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-10pmol/nl. 

Amplification 

The standard PCR protocol was as follows: 50-200ng of genomic DNA was used as a 
template in the presence of 20-40|iM of each oligonucletide primer, 400-800|iM dNTPs 
solution, lx PCR buffer (including 1.5mM MgCl 2 ), 2.5 units TaqI DNA polymerase (using 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shuzo Taq 
polymerase). In some cases, PCR was optimsed by the addition of 10|il DMSO or 50|al 2M 
Betaine. 

After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95°C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72°C. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55°C 


30-60 seconds 
72°C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORF to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PCR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1.5% 
(w/v) agarose gel and the size of each amplified fragment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1% agarose gel or first 
precipitated with ethanol and resuspended in a volume suitable to be loaded on a 1 .0% 
agarose gel. The DNA fragment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol. DNA fragments were 
eluted in a volume of 30^1 or 50^1 with either H20 or lOmM Tris, pH 8.5. 

Digestion of PCR fragments 

The purified DNA corresponding to the amplified fragment was doubly-digested with 
the appropriate restriction enzymes for; cloning into pET-21b+ and expressing the protein as a 
C-terminus His-tagged fusion, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight with 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40|al in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PCR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30|il or 50(al with either H20 or lOmM Tris, pH 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1.0% gel) in the presence of a titrated 
molecular weight marker. 

Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 \ig plasmid was double-digested with 50 units of 
each restriction enzyme in 200 \x\ reaction volume in the presence of appropriate buffer by 
overnight incubation at 37°C After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 \x\ of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 2 6o of the sample, and adjusted to 50 
(ig/jxl. 1 \x\ of plasmid was used for each cloning procedure. 

10|ig of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200jal with the appropriate buffer overnight at 37°C. The digest was loaded 
onto a 1.0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50|il of lOmM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 2 60nm and the concentration adjusted 
to SOjag/jal. 1^1 of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 a molar 



WO 99/057280 



PCT/US99/09346 



58 

ratio of 3:1 fragment/vector was ligated using 0.5 [\\ of NEB T4 DNA ligase (400 units/jj.1), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 100 [il E. coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
37°C for 3 minutes, then, after adding 800 |Lil LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf micro fuge and resuspended in 
approximately 200 |al of the supernatant. The suspension was then plated on LB ampicillin 
(100 mg/ml ). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(pET clones) LB broth + 100 jag/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 |il 5 (il of each individual miniprep (approximately lg ) 
were digested with either NdeVXhol or BamHIIXhol and the whole digestion loaded onto a 1- 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GIBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3:1 
fragment/vector was used in a final volume of 20(^1, that included 0.5|al T4 DNA ligase (400 
units/jxl, NEB) and ligation buffer supplied by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was performed using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into 100p.l of competent E. coli DH5 or 
HB101 by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800jal LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfuge, resuspended in approximately 200|il of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + lOOj-ig/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately ljj.g of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1 -1 .5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (lkb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PCR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E.coli host W3 1 10. Individual clones were grown overnight at 37°C in LB broth with 50^ig/ml 
ampicillin. 

Certain ORFs may be cloned into the pGEX-HIS vector using EcoRI-Pstl cloning 
sites, or EcoRI-SaR, or Sall-Pstl. After cloning, the recombinant plasmids may be introduced 
in the £xoli host W31 10. 

Expression 

Each ORF cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 of each construct was used to 
transform 30 >il of E.coli BL21 (pGEX vector), E.coli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same E.coli 
strain (W31 10) was used for initial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB+Amp (100 |ig/ml), incubated at 37°C overnight, then diluted 1 :30 in 
20 ml of LB+Amp (100 \xg/ml) in 100 ml flasks, making sure that the OD 6 oo ranged between 
0.1 and 0.15. The flasks were incubated at 30°C into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD forpET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of ImM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifuged in a micro fuge, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifuged at 6000# and the pellet resuspended in PBS for further use. 

GST-fusion proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-37°C) to OD 55 o 0.8-1. Protein expression was induced 
with 0.2mM IPTG followed by three hours incubation. The culture was centrifuged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifuged again. The supernatant was 
collected and mixed with 150|al Glutatione-Sepharose 4B resin (Pharmacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifuged at 700g for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS, and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD280 of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700|nl cold Glutathione elution buffer lOmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD 2 go was 0.1. 21 jal of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21.5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST-fusion protein. 

For other ORFs, for each clone to be purified as a GST- fusion, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (100)ig/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 |ag/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (100|ag/ml) liquid medium and allowed to grow at the optimal temperature (20- 
37°C) until the OD 55 onm reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a further 3 hours. 
Bacteria were harvested by centrifugation at 8000xg for 15 min at 4°C. 
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The bacterial pellet was resuspended in 7.5ml cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 150|al Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch-wise preparation was centrifuged at 
700xg for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD 2 80nm of the 
flow-through reached 0.02-0.01. The GST-fusion protein was eluted by addition of 700|j.l cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fractions 
collected, until the OD 2 80nm of the eluate indicated all the recombinant protein was obtained. 
20|al aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 116, 97.4, 66.2, 45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST-fusion proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1:30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD 550 0.6-0.8. Protein expression 
was induced by addition of 1 mM EPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifuged 
again. The supernatant was collected and mixed with 1 50jxl Ni 2+ -resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at 700g for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes, 
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resuspended in 1ml cold lOmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D28O of 0.02-0.06. The resin was washed with 2ml cold 20mM imidazole 
buffer (300 mM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D 2 so of 0.02-0.06. The His-fusion protein was eluted by addition of 
700|il cold 250mM imidazole buffer (300 mM NaCl, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D 2 go was 0.1. 21|al of each fraction were 
loaded on a 12% SDS gel. 

His-fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D550 0.6-0.8. Protein expression was induced by addition 
of 1 mM DPTG and the culture further incubated for three hours. The culture was centrifiiged 
at 8000rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
7.5 ml buffer B (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifier B- 15, .frozen 
and thawed twice and centrifiiged again. The supernatant was stored at -20°C, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOmM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifiiged at 13000 rpm for 40 minutes. The supernatant was mixed with 150^1 
Ni 2+ -resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifiiged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ml buffer B until the flow-through reached the OD 2 go of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D280 of 0.02-0.06. The His-fiision protein was eluted by 
addition of 700|il elution buffer (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 
4.5) and fractions collected until the OD280 was 0.1. 21 jal of each fraction were loaded on a 
12% SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His- fusion, a single colony was streaked out and 
grown overnight at 37°C on a LB/Amp (100 |ag/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 |ig/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1:30 into 600ml LB/Amp (100 
|ig/ml) liquid medium and allowed to grow at the optimal temperature (20-3 7°C) until the 
OD 5 50nm reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration l.OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifiigation at 8000xg for 15 min at 4°C. 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300mM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, 
lOmM Tris-HCl, pH 7.5) and treated with a Dounce homogenizer for 10 cycles. The 
homogenate was centrifuged at 13 OOOxg for 40 min and the supernatant retained. 

Supernatants for both soluble and insoluble preparations were mixed with 150jal Ni 2+ - 
resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg- for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 10ml buffer A or B for 10 min, resuspended in 1.0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the OD280nm of the flow-through 
reached 0.02-0.01. The resin was further washed with either (i) cold buffer C (300rnM NaCl, 
50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the OD280n m of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700|j,l of either (i) cold elution 
buffer A (300mM NaCl, 50mM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 
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collected until the O.D 2 80nm indicated all the recombinant protein was obtained. 20(al aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step 
was employed prior to immunization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200^ig/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced 
glutathione, 0.5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4°C. Further dialysis was performed with buffer II (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, 0.5mM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20jxg/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
phosphate buffer, 5mM reduced glutathione, 0.5mM oxidised glutathione, 2M urea, pH 8.8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was further dialysed 
against dialysis buffer II (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, 0.5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (1.55 x OD 28 o) - (0.76 x OD 26 o) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
500^1 buffer Ml (PBS pH 7.2). 25|al of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 15 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifuged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and 0.1M NaH 2 P0 4 ] and incubated for 3 to 4 hours at 4°C. After 
centrifugation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20mM imidazole and 0.1M NaH 2 P0 4 ] overnight at 4°C. The 
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supernatants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20|i,g of each purified protein are used to immunise mice intraperitoneal^. In the case 
of some ORFs, Balb-C mice were immunised with Al(OH) 3 as adjuvant on days 1,21 and 42, 
and immune response was monitored in samples taken on day 56. For other ORFs, CD1 mice 
could be immunised using the same protocol. For ORFs 25 and 40, CD1 mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was measured on day 42, rather than 56. Similarly, for still other 
ORFs, CD1 mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20)ig of each purified protein was mixed with Freund's 
adjuvant and used to immunise CD1 mice intraperitoneally. For many of the proteins, the 
immunization was performed on days 1,21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 6 2o- The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifuged for 10 minutes at 10000 rpm. The supernatant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then overnight at 4°C with stirring. 100|j.l bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200 jil of 
saturation buffer (2.7% Polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200 |il of 
diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 100 jal of HRP-conjugated rabbit anti-mouse (Dako) serum diluted 1 :2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 37°C. 
Wells were washed three times with PBT buffer. 100 jal of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildi amine and 10 |il of H 2 0) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 (il H2SO4 was added to each 
well and OD490 was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated overnight at 37°C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD620. The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifuged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C 
and then overnight at 4°C with stirring. 100p.l bacterial cells were added to each well of a 96 
well Greiner plate and incubated overnight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200|^1 of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37°C. Wells were washed three times with PBT. 200|al of diluted sera (Dilution buffer: 1% 
BSA, 0.1% Tween-20, 0.1% NaN 3 in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C. Wells were washed three times with PBT. 100|il of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1 :2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. 100|il of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 
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phenildiamine and lOjil of H 2 0 2 ) were added to each well and the plates were left at room 
temperature for 20 minutes. lOO^il of 12.5% H 2 S0 4 was added to each well and OD490 was 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD490 value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD 6 2o- The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifiiged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifiiged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD 6 2o of 0.07. lOOjal bacterial 
cells were added to each well of a Costar 96 well plate. lOOjal of diluted (1:100, 1:200, 1:400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. 
Cells were centrifiiged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200|al/well of blocking buffer in each well. lOOjal of R-Phicoerytrin conjugated 
F(ab) 2 goat anti-mouse, diluted 1:100, was added to each well and plates incubated for 1 hour 
at 4°C. Cells were spun down by centrifugation at 4000rpm for 5 minutes and washed by 
addition of 200|al/well of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200^1/well of PBS, 0.25% formaldehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30 minutes and the 
bacteria disrupted by sonication for 10' on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifugation at 5000# for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at 50000g at 4°C for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at 1 0000# for 1 0 minutes to remove aggregates, and the supernatant further 
ultracentrifiiged at 50000^ for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, pH8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (5\xg) and total cell extracts 
(25 ^ig) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA at 
4°C, using transfer buffer (0.3 % Tris base, 1.44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton X100 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton XI 00 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton XI 00 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37°C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD620 was in between 0.5-0.8. The 
culture was aliquoted into sterile 1.5ml Eppendorf tubes and centrifuged for 20 minutes at 
maximum speed in a microfiige. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD 6 20 of 0.5, diluted 1 :20000 in Gey's buffer and stored 
at 25°C. 
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50(il of Gey's buffer/1% BSA was added to each well of a 96-well tissue culture plate. 
25(il of diluted (1:100) mice sera (dilution buffer: Gey's buffer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25|al of the previously described bacterial 
suspension were added to each well. 25|il of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22\i\ of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22|il of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time lh were counted. 

Gene Variability 

The ORF4 and 919 genes were amplified by PCR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PCR primers were designed in the upstream and downstream regions of the genes: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 

orf4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) C AAAT AAG AAAGG AAT T T T G or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PCR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec. at 94° 

30 sec. at ~ 54° or ~ 60° ( in according to Tm of the primers) 

40 sec. at 72° 
1 cycle 7 min. at 72° 

The PCR products were purified from 1 % agarose gel and sequenced using the 
following primers: 

orf 4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 

919.5 (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
The following Examples are presented to illustrate, not limit, the invention 

EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 1: Oligonucleotides used for PCR for Examples 2-10 



ORF 


Primer 


Sequence 


Restriction sites 


279 


Forward 
Reverse 


CGCGGATCCCATATG-TTGCCTGCAATCACGATT 
<SEQ ID 3021 > 

CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SEQ 
ID 3022> 


BamHI-Ndel 
Xhol 


519 


Forward 
Reverse 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 
<SEQ ID 3023> 

CCCGCTCGAG-I I I GGCGG I I I I GCTGC <SEQ ID 
3024> 


BamHI-Ndel 
Xhol 


576 


Forward 
Reverse 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 
<SEQ ID 3025> 

CCCGCTCGAG-AI I I AC I I I I I I GATGTCGAC 
<SEQ ID 3026> 


BamHI-Ndel 
Xhol 


919 


Forward 
Reverse 


CGCGGATCCCATATG-TGCCAAAGCAAGAGCATC 
<SEQ ID 3027> 

CCCGCTCGAG-CGGGCGGTATTCGGG <SEQ ID 
3028> 


BamHI-Ndel 
Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAACACAGCTTTACAT 
<SEQ ID 3029> 


BamHI-Ndel 
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Reverse 


CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEQ 
ID 3030> 


Xhol 


128 


Forward 
Reverse 


CGCGGATCCCATATG-ACTGACAACGCACT <SEQ 
ID 3031 > 

CCCGCTCGAG-GACCGCGTTGTCGAAA <SEQ ID 
3032> 


BamHI-Ndel 
Xhol 


206 


Forward 
Reverse 


CGCGGATCCCATATG-AAACACCGCCAACCGA 
<SEQ ID 3033> 

CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 
<SEQ ID 3034> 


BamHI-Ndel 
Xhol 


287 


Forward 
Reverse 


CCGGAATTCTAGCTAGC-C I I I CAGCCTGCGGG 
<SEQ ID 3035> 

CCCGCTCGAG-ATCCTGCTC I till IGCC <SEQ ID 
3036> 


EcoRI-Nhel 
Xhol 


406 


Forward 
Reverse 


CGCGGATCCCATATG-TGCGGGACACTGACAG 
<SEQ ID 3037> 

CCCGCTCGAG-AGGTTGTCCTTGT CTATG <SEQ 
ID 3038> 


BamHI-Ndel 
Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from N. gonorrhoeae, M m" 
means a sequence from N. meningitidis B, and "a" means a sequence from N, meningitidis A; 
"#" means the number of the sequence; "seq ,f means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrohoeae DNA sequence, 
number 1 . The presence of the suffix "-1" to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
frames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with ".ng" or ".a", indicating that the ORF corresponds to a N. gonorrhoeae sequence 
or a N. meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m", and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 



ORF: 


contig: 










279 


gnm4.seq 










The following partial DNA sequence was identified in N. meningitidis <SEQ 


m279.seq 












i 


ATAACGCGGA 


TTTGCGGCTG 


CTTGATTTCA 


ACGGTTTTCA 


GGGCTTCGGC 


51 


AAGTTTGTCG 


GCGGCGGGTT 


TCATCAGGCT 


GCAATGGGAA 


GGTACGGACA 


101 


CGGGCAGCGG 


CAGGGCGCGT 


TTGGCACCGG 


CTTCTTTGGC 


GGCAGCCATG 


151 


GCGCGTCCGA 


CGGCGGCGGC 


GTTGCCTGCA 


ATCACGATTT 


GTCCGGGTGA 


201 


GTTGAAGTTG 


ACGGC TTCG A 


CCACTTCGCT 


TTGGGCGGCT 


TCGG C AC AAA 


251 


TGGCTTTAAC 


CTGCTCATCT 


TCCAAGCCGA 


GAATCGCCGC 


CATTGCGCCC 


301 


ACGCCTTGCG 


GTACGGCGGA 


CTGCATCAGT 


TCGGCGCGCA 


GGCGCACGAG 


351 


TTTGACCGCG 


TCGGCAAAAT 


TCAATGCGCC 


GGCGGCAACG 


AGTGCGGTGT 


401 


ATTCGCCGAG 


GCTGTGTCCG 


GCAACGGCGG 


CAGGCGTTTT 


GCCGCCCGCT 


451 


TCTAAATAG 










This corresponds to the amino acid 


sequence <SEQ ID 3040; ORF 279>: 


m279.pep 












i 


ITRICGCLIS 


TVFRASASLS 


AAGFIRLQWE 


GTDTGSGRAR 


LAPASLAAAM 


51 


ARPTAAALPA 


ITICPGELKL 


TASTTSLWAA 


SAQMALTCSS 


SKPRIAAIAP 


101 


TPCGTADCIS 


SARRRTSLTA 


SAKFNAPAAT 


SAVYSPRLCP 


ATAAGVLPPA 


151 


SK* 











The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 304 1>: 

g279 . seq 

1 atgacgcgga tttgcggctg cttgatttca acggttttga gtgtttcggc 

51 aagtttgtcg gcggcgggtt tcatcaggct gcaatgggaa ggaacggata 

101 ccggcagcgg cagggcgcgt ttggctccgg cttctttggc ggcagccatg 

151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 

201 gttgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 

251 tctgcctgac ctgttcatct tccaaaccca aaatggccgc cattgcgcct 

3 01 acgccttgcg gtacggcgga ctgcatcagt tcggcgcgca ggcggacgag 

3 51 tttgacggca tcggcaaaat ccaatgcttc ggcggcgaca agcgcggtgt 

401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 

451 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 3042; ORF 279.ng>: 

g2 7 9 -pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 
51 VRPTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS SKPKMAAIAP 
101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVL PPT 
151 SK* 

ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N, gonorrhoeae: 

10 20 30 40 50 60 

m279 . pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

MINIMUM: M Ml IM MMMI Mi M M M MM I IMM MM M I MM I 

g2 7 9 MTRICGCLISTVLSVSASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMVRPTAAALPA 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m279 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 

II. II MM II I MM I llh MMM I hMI III II I II Ml MM II I III M 

g2 79 ITTCPGELKLTASTTSPCADSAQICLTCSSSKPKMAAIAPTPCGTADCISSARRRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 79 . pep SAKFNAPAATSAVYSPRLCPATAAGVLPPASKX 

Ml M IMIIMIMMIMIIIIIIhIM 

g2 79 SAKSNASAATSAVYSPRLCPATAAGVLPPTSKX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3043>: 

a279 . seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GC7AATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

2 01 GTTG7AAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 
251 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 
301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

3 51 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

4 01 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 
4 51 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279. a>: 

a279.pep 

1 MTXICGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAAS1 
51 ARS T AAAL PA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIAP 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 
151 SE* 

m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 152 aa overlap 

10 20 30 40 50 60 

m27 9.pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 
: I I I I I I I I M I I I I I I I I I I I : M II I i II I I I I I I M I I I I I II : : I I I I M I I I 
a27 9 MTXICGCLISTVXRASASLSAAGFMRLQWEGTDTGSGRARLAPASLAASIARSTAAALPA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m27 9 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
II I I I I t I I I I I I I I I 111: : I M II I I I I M I II I I II I I I II I I II I I I I I I 
a27 9 I TTCPGELKLTASTTSSCADSAQICFTCSSSKPRIAAIAPTPCGTADCIS SARXRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 7 9 . pe p SAKFNAPAAT S AV Y S PRLC PATAAG VLP PAS KX 

III I I II I I I I I I M II M I I I I I I I I II : I 
a 2 7 9 SAKSNAPAAT SAVY S PXLC PATAAG VLP PAS EX 

130 140 150 



519 and 519-1 gnm7.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3045>: 

m519.seq (partial) 

1 . . TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGgCTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACTTGGTTCC GCCGCAAGAA 

151 ATCCTTCGCT CAATGCAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GT CGT AAAAT CGAACAAATC AACCTTGCCA 
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2 51 GTGGTCAGCG CGAAGCCGAA 

3 01 GCGGTCAATG CGTCAAATGC 

3 51 AGGTGAAGCG GAATCCTTGC 

4 01 TCCGTCAAAT TGCCGCCGCC 
451 AATCTGAAGA TTGCGGAACA 

5 01 AGAAAGCAAT ACGCTGATTA 
5 51 TGATTTCTGC CGG T ATG AAA 



ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 
CGAGAAAATC GCCCGCATCA ACCGCGCCAA 
GCCTTGTTGC CGAAGCCAAT GCCGAAGCCA 
CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
ATACGTCGCT GCGTTCAACA ATCTTGCCAA 
TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
ATTATCGACA GCAGCAAAAC CGCCAAaTAA 



This corresponds to the amino acid sequence <SEQ ID 3046; ORF 519>: 

m519.pep (partial) 

1 . . SVIGRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TL I MPANVAD IGSLISAGMK I IDSSKTAK* 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3047>: 

g519 . seq 

1 atggaatttt tcattatctt gttggcagcc gtcgccgttt tcggcttcaa 

51 atcctttgtc gtcatccccc agcaggaagt ccacgttgtc gaaaggctcg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaattg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggttcg 

3 01 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 

3 51 cgttatcggg cgtatggagt tggacaaaac gtttgaagaa cgcgacgaaa 

4 01 tcaacagtac cgtcgtctcc gccctcgatg aagccgccgg ggcttggggt 
4 51 gtgaaagtcc tccgttacga aatcaaggat ttggttccgc cgcaagaaat 
501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 
551 gtattgccga atccgaaggc cgtaaaatcg aacaaatcaa ccttgccagt 
601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggctgc 
651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 
701 gcgaagcgga atccctgcgc cttgttgccg aagccaatgc cgaagccaac 
751 cgtcaaattg ccgccgccct tcaaacccaa agcggggcgg atgcggtcaa 
801 tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 
851 aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 
901 aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cggccaaata 
951 a 

This corresponds to the amino acid sequence <SEQ ID 3048; ORF 519.ng>: 

g519 .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 I DRVAYRH S L» KEIPLDVPSQ VCITRDNTQL TVDGI IYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

2 01 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAN 

2 51 RQIAAALQTQ SGADAVNLKI AGQYVTAFKN LAKEDNTRIK PAKVAEIGNP 

3 01 NFRRHEKFSP EAKTAK* 



ORF 519 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 519.ng) 
from N. gonorrhoeae: 

m519/g519 

10 20 30 

m519 .pep SVIGRMELDKTFEERDEINSTWAALtDEAA 

I I I I M I M I I I ! I I I I I I I I I h I I II I I 

g519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTVVSALDEAA 
90 100 110 120 130 140 



40 50 60 70 80 90 

m5 19 . pep GAWGVKVLRYEIKDLVPPQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
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gS19 



MIIIIMIIMMIIIIIIIIhlMIMIIIIIIIIIIIMIIMIIMIIMIIIM 

GAWGVKVLRYE I KDLVP PQE I LRAMQAQ I TAEREKRARI AESEGRK I EQI NLASGQREAE 
150 160 170 180 190 200 



100 110 120 130 140 150 

m519 . pep I QQ S EG E AQ AAVNAS N AE K I AR I NRAKG E AE S LRLV AEANAE A I RQ I AAALQ TQGGAD AV 

I 1 { I I 1 1 I I I 1 I I 1 I I t I I I i { I t 1 1 I I I I I I 1 1 I I I I 1 1 1 1 I I !l I I I Mlhl I II I 

g519 IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEANRQIAAALQTQSGADAV 
210 220 230 240 250 260 

160 170 180 190 200 

m519 .pep NLKIAEQWAAFNNLAKESNTL I MPANVADIGSL- ISAGMKI IDSSKTAK 

Mill MhlhlMlhll I Ihlhlh = h HIM 

g519 NLKIAGQYVTAFKNLiAKEDNTRIKPAKVAEIGNPNFRRHEKFSPEAKTAK 
270 280 290 300 310 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 3049>: 

a519 . seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

2 01 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

2 51 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 
801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

8 51 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORF 519.a>: 

a519 .pep 

1 MEFFIILLAA WVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKT FEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

2 51 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKI IDS SKTAK* 

m519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 

10 20 30 

m519 . pep SVIGRMELDKT FEERDEINSTWAALDEAA 

I I M I 1 I I I M I It I I I ! I I I II : I I I I I I 
a519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTWSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 90 

m519 .pep GAWGVKVLRYE IKDLVPPQE I LRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 

M I I I I I I I M I I I I I M I 1 1 I I II I t I I I I I I I I I I M I II I I II M I I I I I I I I I I I I 
a519 GAWGVKVLRYE I KDLVPPQEILRSMQAQI TAEREKRARI AESEGRKIEQINLASGQREAE 

150 160 170 180 190 200 



100 



110 



120 



130 



140 



150 
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mS19 . pep 



a519 



IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
I I I M I I I I I I I I I I M I M I I I I I I I I II M I I I I I I I I i I I 11 I I II I II I! I I M I I 
IQQSEGEAQAAVNAS>3AEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
210 220 230 240 250 260 



m519 . pep 



a519 



160 170 180 190 200 

NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 

I I I I I t I I I 1 I II I I M I I I I I I I I M I if M I I I I I I I M i I i I I I I I I 
NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLISAGMKIIDSSKTAKX 
270 280 290 300 310 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 305 1>: 

m519-l . seq 



1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 

51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GAAAGGCTGG 

101 GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC TGTTGTTGCG GCTTTGGACG AGGCGGCCGG GGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 
751 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

8 01 TCTGAAGATT GCGGAACAAT ACGTCGCTGC G T T C AAC AAT CTTGCCAAAG 
8 51 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 3052; ORF 519-1>: 

m519-l. 



1 MEFFIILLVA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWA ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

2 51 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3053>: 

g519-l . seq 



1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTGG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG ACTGTTGACG 

2 51 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCAAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG GGCTTGGGGT 

4 51 GTGAAAGTCC TCCGTTACGA AATCAAGGAT TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCGCA ATGCAGGCAC AAATTACCGC CGAACGCGAA AAACGCGCCC 

551 GTATTGCCGA ATCCGAAGGC CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGTG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

7 01 GCGAAGCGGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGGGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTAGCCGC GTTCAACAAT CTTGCCAAAG 
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851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-1 ng>: 

g519-l .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 



m519-l/g519-l ORFs 519-1 and 519-1. ng showed a 99.0% identity in 315 aa 
overlap 

10 20 30 40 50 60 

g519-l .pep MEFFIILLAAVAVFGFKSFVVIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 
M I I I I I I : M I I I II M I I I I 1 M I I 1 I I M I M I I i I I I I I I i M t I I I I I I I II I I I 
m519-l MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIURVAYRHSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

g519-l .pep KEIPLDVPSQVCITRDNTQLTVDGI IYFQVTDPKIASYGSSNYIMAITQLAQTTLRSVIG 
M I I I II I I I I I I I I I I II I I I t 1 II M I M I I I I I I I I I I I I I I I 1 I I I I II I M I I I I 
m519-l KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g519-l .pep RMELDKT FEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
11 M I i I I I I I II I I I I I i : I I II I I I i I I I t I I I I I I I I I ] I I j I I I I : I I I I I 11 I I I 
m519-l RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 180 

190 200 210 220 230 240 

g519-l.pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
M t I I I II I I I I I I I I I I t M I I I I I I I I I I I I I I 11 I I I I I 1 I I I I I I I I I I I I II I I I 
m519-l KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

g519-l . pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I I I I I I II I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
m519-l LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

250 260 270 280 290 300 

310 

g519-l .pep ISAGMKIIDSSKTAKX 
I I I I I I I M I I I I I I I 
m519-l ISAGMKIIDSSKTAKX 

310 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3055>: 

a519-l .seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

2 51 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 
301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

3 51 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 
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4 01 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 
801 TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

8 51 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3056; ORF 519-l.a>: 

a519-l . pep. 

1 MEFFIILLAA VWFG FK5FV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGIIYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519-l/a519-l ORFs 519-1 and 519-1. a showed a 99.0% identity in 315 

overlap 

10 20 30 40 50 60 

a519-l . pep MEFFIILLAAVWFGFKSFVVIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
I M | I I I I : I I : I I I 1 I I I I I II I I I II I I i I I ! I I I I I M I I I I t I f I I I I I M I I I I I 
m519-l MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 

10 20 30 40 50 60 



70 80 90 100 110 120 

a519-l.pep KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
I I t I I I I I I I I I I I I I I I I I I I I I I I I M 1 I M M I I I I I I I I I I I I M I I I M I 11 11 I 
m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

70 80 90 100 110 120 



130 140 150 160 170 180 

a519-l.pep RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
II I 1 I I I I I M I 1 I I II t I : 1 I II I I I I I I I I I 11 I M I I I II I I I I I M I I II I M 1 I I 
m519_l RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 180 



190 200 210 220 230 240 

a519-l . pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I I I I I I I 11 I I M I I I I I I I I I I I I II I 1 I I I I I E I M I I I I I I I I I I I M I I I II I I I I 
m519-l KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 

250 260 270 280 290 300 

a519-l . pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I M M I I M I I I I I I I I I I M I I I I 1 II I I I M II I M I I I I I I I I I I It I I I I I I M 11 
m519-l LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 

250 260 270 280 290 300 



310 

a519-l .pep ISAGMKIIDSSKTAKX 
I I I I I! I I I I II I I I I 
m5 1 9 - 1 I SAGMKI I D SSKTAKX 

310 
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576 and 576-1 gnm22.seq 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3057>: 

m576.seq.. (partial) 

1 . . ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

51 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

101 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

151 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

201 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

2 51 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

301 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

351 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

4 01 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

4 51 GTGATTCCGG GTTGGACCGA AGgCGTACAG CTTCTGAAAG AAGGCGGCGA 

501 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

551 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

601 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

651 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

rn.576.pep-. (partial) 

1 ..MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

201 KIGAPENAPA KQPAQVDIKK VN* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3059>: 

g576.seq. . (partial) 

1 . . atgggcgtgg acatcggacg ctccctgaaa caaatgaagg aacagggcgc 

51 ggaaatcgat ttgaaagtct ttaccgatgc catgcaggca gtgtatgacg 

101 gcaaagaaat caaaatgacc gaagagcagg cccaggaagt gatgatgaaa 

151 ttcctgcagg agcagcaggc taaagccgta gaaaaacaca aggcggatgc 

201 gaaggccaac aaagaaaaag gcgaagcctt cctgaaggaa aatgccgccg 

251 aagacggcgt gaagaccact gcttccggtc tgcagtacaa aatcaccaaa 

301 cagggtgaag gcaaacagcc gacaaaagac gacatcgtta ccgtggaata 

351 cgaaggccgc ctgattgacg gtaccgtatt cgacagcagc aaagccaacg 

401 gcggcccggc caccttccct ttgagccaag tgattccggg ttggaccgaa 

4 51 ggcgtacggc ttctgaaaga aggcggcgaa gccacgttct acatcccgtc 

501 caaccttgcc taccgcgaac agggtgcggg cgaaaaaatc ggtccgaacg 

551 ccactttggt atttgacgtg aaactggtca aaatcggcgc acccgaaaac 

601 gcgcccgcca agcagccgga tcaagtcgac atcaaaaaag taaattaa 

This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 

g576.pep. . (partial) 

1 . . MGVDIGRSLK QMKEQGAEID LKVFTDAMQA VYDGKEIKMT EEQAQEVMMK 
51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAE DGVKTT ASGLQYKITK 

101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 

151 GVRLLKEGGE ATFYIPSNLA YREQGAGEKI GPNATLVFDV KLVKIGAPEN 

201 APAKQPDQVD IKKVN* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m576/g576 ORFs 576 and 576. ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m57 6 . pep MQQASYAMGVDIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I 1 I I I I I I M I I M I I I I I I I I 
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g 57 6 MGVDIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

10 20 30 40 50 

70 80 90 100 110 120 

m57 6 . pep EQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIV 
I M II I I I I I I I I I I I I M I M I I I I I I I I : I I I I M I I I I I ! I I M I I I I I I I I I M 1 I 
g57 6 EQQAKAVEKHKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQGEGKQPTKDDIV 
60 70 80 90 100 110 



130 140 150 160 170 180 

m57 6 .pep TVEYEGRLIDGTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEAT FYIPSNLAYRE 

M I M I I I I I I I II I M i I I I I I : N I I I M I I II i I M : I I I I If I I I M I M I I I I M 
g57 6 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYRE 
120 130 140 150 160 170 



190 200 210 220 

lt\57 6 . pep QGAGDKIGPNATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
I I I I : I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
g 5 7 6 QGAGEKIG PNATLV FDVKLVKI GA PENAPAKQPDQVD I KKVNX 

180 190 200 210 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 3061>: 

a576.seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

2 51 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

451 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576. a>: 

a576 . pep 

1 MNTIFKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

m576/a576 ORFs 576 and 576. a showed a 99.5% identity in 222 aa overlap 

10 20 30 

m57 6 . pep MQQASYAMGV D I GRS LKQMKEQGAE I DLKV 

I I I II I M I I I I I I I ! I I I I I II I I I 1 I I I 
a57 6 CGKKEAAPAS ASE PAAAS S AQG DT S S I G STMQQASYAMGVD I GRS LKQMKEQGAE I DLKV 

30 40 50 60 70 80 



40 50 60 70 80 90 

m57 6 . pep FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
a57 6 FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

90 100 110 120 130 140 
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100 110 120 130 140 150 

m57 6 . pep KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLI DGTVFDSSKANGGPVTFPLSQ 

I I I M I ! I I I I I I M I I I I I I I I I 1 I I I I I I I M M I I I M I M I II I I II I I I I I I II I 
a57 6 KDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLI DGTVFDSSKANGGPVTFPLSQ 

150 160 170 180 190 200 

160 170 180 190 200 210 

m57 6.pep VIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

II I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I 1 I I M I I M I I I I I I I i I 
a57 6 VILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

210 220 230 240 250 260 

220 

m57 6 . pep KQPAQVDIKKVNX 
I I I I II I I I I I I I 
a57 6 KQPAQVDIKKVNX 

270 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3063>: 

m57 6-1 . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

2 01 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

2 51 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576-l>: 

m57 6-1 . pep 

1 MNTIFKI SAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3065>: 

g576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

4 01 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

4 51 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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7 01 GCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTC 
751 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG ATCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3066; ORF 576-1. ng>: 

g57 6-1 . pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASAA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPATFPLSQ 

201 VIPGWTEGVR LLKEGGEATF YIPSNLAYRE QGAGEKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPDQVDIKK VN + 



g576-l/m576-l ORFs 576-1 and 576-1. ng showed a 97.8% identity in 272 aa 

overlap 

10 20 30 40 50 60 

g57 6-l .pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASAAQGDTSSIGSTMQQASYAMGV 
I I I I I I I I I I I I II I I I I I I I I I I I I I I I f I I I I I I I I : I 1 I I II I I M I I! II II I M I 
m57 6-l MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g57 6-1 . pep D I GR S LKQMKEQG AE I DLKV FT DAMQAV Y DGKE I KMTEEQAQE VMMK FLQEQQAKAVEKH 
I I I I I I I I I M I II M M I I I I : I I I I I I II M I I I I I I I I I M I I I I I I I I I I I I 1 I I I 
m57 6-l D I GRSLKQMKEQGAE I DLKV FT EAMQAVY DGKE I KMTEEQAQE VMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

g57 6-l .pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
m57 6-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
* 130 140 150 160 170 180 

190 200 210 220 230 240 

g57 6-l .pep GTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEAT FYIPSNLAYREQGAGEKIGPN 
I I I I I II I I I I I I : I I I I M II I I I II I I : II I I I I I I II ! I I I M I I I I I II I : I I I I I 
m57 6-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEAT FYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

g 57 6-1 .pep ATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
I I I f I I I I I I I I I I I II M I II I MINIMI 
m57 6-1 ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 

The following DNA sequence was identified in TV. meningitidis <SEQ ID 3067>: 

a576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAG CCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

2 01 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

2 51 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 
301 GCTCAGGAAG T CAT GAT G AA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

3 51 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 
4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 
501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 
551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 
601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 
651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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7 01 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

8 01 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORF 576-l.a>: 

a57 6-1 . pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

2 51 KIGAPENAPA KQPAQVDIKK VN* 

a576-l/m576-l ORFs 576-1 and 576-1. a showed a 99.6% identity in 272 

overlap 

10 20 30 40 50 60 

a57 6-l .pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 
M I I I I I I I I I I I I I I I 1 I I I II i M i 1 M I I I M I M I M I I I i I I ! I j I I i I I I I I I I 
m57 6-l MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a57 6-l .pep DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
I I I I I I I I I I I I i I I I t I I I I I I t I I I I I I I I I I I I t I I I I I I I I I I I I I t I I I It I I I I 
m57 6-l DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a57 6-1 . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
I II I I ! M M I I I I I I I I I I M I I I I I I I I I M I I I I I I I ! I I I I I I I M I I I I I I I I I I 
m57 6-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 

130 140 150 160 170 180 

190 200 210 220 230 240 

a57 6-1 . pep GTVFDSSKANGGPVTFPLSQV I LGWTEGVQLLKEGGEATFY I PSNLAYRE QGAGDKIGPN 
I I I I I I 1 I I II I II I I It I I I I I M I I I I I I It I I I I II It I I I II I I I I I I I I I I I I I 
IU57 6-1 GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEAT FYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

a57 6-l . pep ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
m57 6-l ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 



919 gnm43.seq 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3069>: 

m919 . seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAG CATC C A AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

2 01 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 
251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

3 01 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

3 51 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 
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4 51 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

6 01 CATACCGCCG ACCTCTCCcG ATTCCCCATC ACCGCGCGCA CAACAGCAAT 

6 51 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCTG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCyTA CGTTTCCATC GGACGCTATA TGGCGGATAA GGGCTACCTC 

901 AAACTCGGAC AAACCTCCAT GCAGGGCATT AAGTCTTATA TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGGGAATA TGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGTGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGG C 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTC CTACCCAACG 

13 01 GTATGAAGCC CGAATACCGc CCGTAA 



This corresponds to the amino acid sequence <SEQ ID 3070; ORF 919>: 

m919 .pep 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQD FAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSRFPI TARTTAI KGR FEGSRFLPYH TRNQINGGAL DGKAP I LGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

3 01 KLGQTSMQGI KSYMRQNPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 
351 TPLMGEYAGA VDRHYITLGA P L FV AT AH P V TRKALNRL I M AQDTGS AI KG 

4 01 AVRVDYFWGY GDE AGE LAG K QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 307 1>: 

g919 . seq 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCgccAT 

51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AAC CTTTCCG CAACCCGACA 

101 CATC CGT CAT CAACGGCCCG GACCGGCCGG CCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TTGCCGGCGG CGGGGCCGTC TAT AC CGTT G TGCCGCACCT 

201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTGCAT TCCTTTCAGG CAAAGcGgTT 

351 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGcCTTG 

4 01 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CG AC GGCAGG 

4 51 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCG CTGCCTGCCG GTTTGCGGGG CGGAAAAAAC CTTGTCCGCA 

551 TCAGGCAGac ggGGAAAAAC AG CGGCACGA TCGACAATGC CGGCGGCACG 

601 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 

651 caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 

701 AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 

751 GAagaccCcG tcgaacttTT TTTCATGCAC AtccaaggCT CGGGCCGCCT 

801 GAAAACCCcg tccggcaaat acatCCGCAt cggaTacgcc gacAAAAACG 

851 AACAtccgTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 

901 AAGctcgggc agACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 

951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 

1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC GATCAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGTTAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

1301 GCATGAAGCC CGAATACCGC CCGTGA 
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This corresponds to the amino acid sequence <SEQ ID 3072; ORF 919.ng>: 

g9l9 . pep 

1 MKKHLLRSAL YGIAAAILAA CQSRSIQTFP QPDTSVINGP DRPAGIPDPA 

51 GTTVAGGGAV YTWPHLSMP HWAAQDFAKS LQSFRLGCAN L KNRQGWQ D V 

101 CAQAFQTPVH SFQAKRFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 

151 RTERARFPIY GIPDDFISVP LPAGLRGGKN LVRIRQTGKN SGTIDNAGGT 

201 HTADLSRFPI TARTTAI KGR FEGSRFLPYH ' TRNQINGGAL DGKAPILGYA 

2 51 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

3 01 KLGQTSMQGI KAYMRQNPQR LAEVLGQNPS YIFFRELAGS GNEGPVGALG 

3 51 TPLMGEYAGA IDRHYITLGA P L FV AT AH P V TRKALNRLIM AQDTGS AI KG 

4 01 AVRVDYFWGY GD E AG E LAG K QKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap with a predicted ORF (ORF 919.ng) 
from N. gonorrhoeae: 

m919/g919 

10 20 30 > 40 50 60 

m919 . pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

MMMIMMIMMIMMIMMMMIMMMMIMIMIIMIMIMIMM 
g919 MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m919 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 

MMMMMIIMMIMMMMMMMMMIMMMMIIIIIIMMMMM 
g 9 1 9 YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHS FQAKRFFER 

70 80 90 100 110 120 



130 140 150 160 170 180 

m919 . pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGI PDDFI SVPLPAGLRSGKA 

IMIIIMIMIMIIMIIIIMMII III MMIMIIIIMIIIIIIMIMM 

g919 YFTPWQVAGNGSLiAGTVTGYYEPVLKGDGRRTERARFPI YGI PDDFI SVPLPAGLRGGKN 

130 140 150 160 170 180 



190 200 210 220 230 240 

m919 . pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

1 1 1 1 1 1 1 1 1 1 i I ^ i 1 1 = i : i i i 1 1 1 1 1 i ; 1 1 1 1 1 : 1 1 1 : 1 1 1 1 1 : 1 1 1 1 i I ; 1 1 1 ; ! i 1 1 

g919 LVRIRQTGKNSGTIDNAGGTHTADLSRFP I TARTTAI KGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 



250 260 270 280 290 300 

m919 .pep DGKAPILGYAEDPVELFFMHIQGSGRIjKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

I IMM II Ml! I I I I I I I IIIMil MM I : IMII I INI II I III M I MM I : I I I 

g 9 1 9 DGKAPI LGY AEDPVELFFMHIQGSGRLKTPSGKYI RI GYADKNEHPYVS I GRYMADKGYL 

250 260 270 280 290 300 



310 320 330 340 350 360 

m919 . pep KLGQTSMQGI KS YMRQNPQRLAEVLGQNPSY I FFRELAGSSNDGPVGALGTPLMGEYAGA 

MINI Ml Ihl MINI II I Ml 1 1 MM MINI llhhl I MM I MINI 1 1 II 

g 9 1 9 KLGQTSMQGI KA YMRQN P QRLAE VLGQN P S Y I FFRELAGSGNEGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 



370 380 390 400 410 420 

m9 1 9 - pep TORHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAI KGAVRVDYFWGYGDEAGELAGK 

= 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

g 9 1 9 I DRHYI TLGAPLFVATAHPVTRKALNRLIMAQDTGS AI KGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 
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430 440 
m919 .pep QKTTGYWQLLPNGMKPEYRPX 

i Mil Mill! MM I I I MM 

g919 QKTTGYWQLLPNGMKPEYRPX 

430 440 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3073>: 

a919 . seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TGCGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTT TATACCGTTG TGCCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

2 51 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCGTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

4 01 CCGGTACGGT TACCGGCTAT TACGAGCCGG TGCTGAAGGG CGACGACAGG 

4 51 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCCA ATTCCCCATC ACTGCGCGCA CAACGGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

7 01 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

7 51 GAAGACCCCG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

851 AACATCCCTA CGTTTCCATC GGACGCTATA TGGCGGACAA AGGCTACCTC 

901 AAGCTCGGGC AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 

951 CCCGCAACGC CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 

1001 TCCGAGAGCT TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCGGCTGA TGGGCGAGTA CGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

12 01 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACG 

1301 GTATGAAGCC CGAATACCGC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 3074; ORF 919.a>: 

a 91 9 .pep 

1 MKKYLFRAAL CGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQT PVH SVQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KAYMQQNPQR LAEVLGQNPS YIFFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a919 ORFs 919 and 919. a showed a 98.6% identity in 441 aa overlap 

10 20 30 40 50 60 

m919.pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
I I I I I t I I I i I M I I 1 I I I I M I I I I I I It M I I I II I I I I I I I ! t I 1 I I I I I I I M II 
a919 MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m919 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 

I I I I I I I I I I i I I I I I I II I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a919 YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m919 .pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

I I I I I I I I I I I I I I I I I II I It I I I I M I M I I I I I I M I I I I I M M I I II III I I i I I 
a919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m919 .pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

II I I I I I I I I I I I I I I I I I I M I I I I : I I M I I I I I I If I I I I I I I I I I I I I I I M ! I M 
a 91 9 LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m919 .pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
I I I I I I I I I I I M I I I I I I It I I I M 1 I I I I I I I I I I I M I I I I I II I 1 I I I I! I I I I M 
a 919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m919 .pep KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYI FFRELAGSSNDGPVGALGTPLMGEYAGA 

I I t I I I I I I I I : I I : I I I I I I I I I I I II I I I I I I I I I : I I I I I I I I I M I I t I I t I I I I 1 
a919 KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m919 . pep VDRHYITLGAPLFVAT7VHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
I I I I I I I I I I I I I I I I I I I I I j I I I I I i I I I t I II I I I I I I 1 I I I I I It I I I I I I I I I I I 
a919 VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 

430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPEYRPX 
I I I I I I 'I I I i I II I I I I M I I I 
a 91 9 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



121 and 121-1 



The following partial DNA sequence was 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT 

51 GGCGGATGCC G TACT GAT AC 

101 AAGGGCACGC CTTTACCCCC 

151 GATTTGCAGG ACACAGGCGC 

201 GCAAGAACTC AGCCGCCTAT 

251 GTCAAAACCT CGCACCGTCC 

301 ACCGTCCGAC ACGCGCCGGA 

351 GCCGCTGCTG GCGxxxxxxx 

401 xxxxxxxxxx xxxxxxxxxx 

4 51 xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA 

651 CATATTGCCG CAACTGCTCG 

701 AACGCCACCC TAAAAGCACG 

7 51 GAAACCTACC TTGACGGCGG 

801 TTCCCGTTTT ACCGCGCAAA 

851 CAGATGCCCG TCAAATGTAC 

901 TTAATGGCGG ATTTGGCAGA 

951 CACCGCCGAC CTGAACCTCG 

1001 CGTGGTTGGC GGCGTGTTGG 



identified in N. meningitidis <SEQ ID 3075>: 



CGGCATCATG 
GGATGGACGG 
TACCCCGGCA 
AGACGAACTG 
ATGCGCAAAC 
GACATTACCG 
ACACGGTTAC 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
CAAAAACGGT 
ACAGGCTGCT 
GGGCGCGAAC 
CGAAAACCGA 
CCGTTTGCGA 
ATTTGCGACG 
ATGTTTCGGC 
ATCCGCAATG 
ATTAATCGCA 



TCGGGAACCA 
CGGCAAATGG 
GGTTACGCCG 
CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AGCATACAGC 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
GCAAAGTCGG 
CGCCCACCCG 
TGTTTGCCAT 
TACGACGTAT 
CGCCGTCTCA 
GCGGCATCCG 
ACACGCGTTT 
GGTGGAAGCC 
TTCCCGGTAG 



GCATGGACGG 
CTGGGCGCGG 
CCAATTGCTG 
GGATTTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
CACAAGGCAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
GCCGnATTTG 
TCCGCACAAA 
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1051 GCAACCGGCG CATCCAAACC GTGTATTCTG AnCGCGGGAT ATTATTATTG ' 
1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 121>: 

ml 21 . pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLC SQN LAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AX F AW LAAC W INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3077>: 

gl21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

4 01 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

701 AACCCcaccc a a AAAG C AC G GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tgcggacgct 

801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3078; ORF 121.ng>: 

gl21.pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae: 

ml21/gl21 

10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
! I I I I I I I I I It I I I I I I I I 1 I : I I I M I M I M I i M I I M M I I : I I I I I I i I : I i I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 .pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
It I I : I I II I I I I t I I I I I I I I I I I I I I I i I I I i I I I I I I I I I I I I I I I I I I I E I I I M 
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gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
g!21 AELTRIFTVGDFRSRDLAAGGQGAPLV PAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I I I I I M I i : I I M I I 1 I I I I I I I I t I I : I Mill 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I M : II I I I I I I I I i I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I II I I I I I I M I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3079>: 

al21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

2 51 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121. a>: 

al21 . pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
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301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 
351 ATGASKPCIL GAGYYY+ 

ml21/al21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml 21 .pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I M I I I I I I I I I I I I M II I I I I I I I 1 I M II I I i II I I I I I I! M : I E I M I I I M II 
a!21 METQLYIG1MSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml21 .pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I M I : I II I I I I I I M II I I I I II I I M I I I I I I I I II I I I I I I I I I I : II : M I I I M I 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

al21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml21 .pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: M I I I I I I I I : t I I I M I 11 I II I I I II M I 1 I I I I I 

a 121 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml2 1 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I M I : I I I I I I I I I I I I I I M I I I I I I I 11 I I I I I I I I I I I I I I II 11 I I I I M I I 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I I I I I I I I I I I I I I I I I : I I 1 I I I I I I I I E II : I I I I : I I I I I II I I I I I I I I I I I I 
a l21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRI PGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

a 121 GAGYYYX 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 308 1>: 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

3 51 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 
4 51 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 
501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 
601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
7 01 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 
801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 



WO 99/057280 



PCT/US99/09346 



91 



851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

9 51 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3082; ORF 121 -1>: 

ml21-l . pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

3 51 ATGASKPCIL XAGYYY* 

ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa overlap 

10 20 30 40 50 60 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I | | I I I I I I M I M M I I I M I : I I It I II I I M I I I I M I I i ! 11 : I I I I I I I I : I I! 
METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 



ml21-l . pep 
gl21 



70 80 90 100 110 120 

ml21-l -pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

I I I I : I I I M I I I I I I I I M I I I I I I I M I I II I M II I I M I I I I M II I I I I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l .pep AERTRI FT VGDFRSRDIiAAGGQGAPLV PAFHEALFRDNRETRAVLNIGGIANISVLPPDA 

II I M I E I I I I I I 11 I I I I I I I I M M I I I I i II I I : I I M : M I I I I I I i I I I I II I 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml21-l . pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I j I I I I M I I I I I M I I I I I I M I M I I I I I I I I I M I I I I II I I II I : I I I M I I 
g 121 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

IU121-1 -pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I | j I M I I M I I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I I M I I I I I I I I I I I 
g l21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVS HAAADARQMY I CGGG I RN PV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l -pep LMADLAEC FGTRVSLHSTADLNLDPQWVE AAXFAWLAACW INRI PGS PHKATGASKPC I L 
M I I I I I I I 1 I I I I I I I M : I I I I I I I I M I M I I I I I I I M I I II I I M I I I I I I I I I 
g 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACW INRI PGS PHKATGASKPC I L 

310 320 330 340 350 360 



ml 2 1-1 .pep XAGYYYX 
I I I I 11 

gl21 GAGYYYX 

The following DNA sequence was identified inN. meningitidis <SEQ ID 3083>: 

al21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

2 01 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

2 51 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

7 01 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 
801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

8 51 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GC AACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3084; ORF 121-1. a>: 

al21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

2 01 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

2 51 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/al21-l ORFs 121-1 and 121-1. a showed a 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I ! I I I I I 1 I I I I M 1 I I I I I It I I I I M I I II M I i I I I ! I I I I I : I I 1! 1 I I I I i I I 
METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I I I ! I I I I I I 11 I I I I I I I I I I II I I I I I I I I I II I I I I I I I : I I : ! I I I I I I I 
HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

AERTRI FT VGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGG I ANISVLPPDA 
I I I I : I I I I I II 1 M I II I I I I M M I I I I I I I I II I : I I I I I I I I II I I I I I I I I I I I I 
AERTQI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 
130 140 150 160 170 180 

190 200 210 220 230 240 

PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I M I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
PAFG FDTGPGNMLM DAWMQAHWQLPY DKNGAKAAQGN I LPQLLDRLLAH PY FAQPH PKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMY ICGGGIRNPV 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I II I I I I I I M I I I I 
GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 



ml21-l . pep 
al21-l 

ml21-l .pep 
al21-l 

ml21-l . pep 
a!21-l 

m!21-l .pep 
al21-l 

ml21-l . pep 
al21-l 
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310 320 330 340 350 360 

ml21-l . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
M I I M M M I I M I It I I : I M I I I It II I It I : I I I I : I II M I I 1 i I I I I i M t I I 
a 1 2 1 LMADLAEC FGTRVS LH STAELN LD PQW VE AAAFAWMAACWVNR IPGS PHKATGAS K PC I L 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
MINI 

al21 GAGYYYX 



128 and 128-1 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3085>: 

ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 
AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 
TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 
CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 
TCGAGCTGTA CAACCGCTTC AAAAC CAT C A AAAATTC CCC 
ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 
AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 
TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 
ACTmTACGGC ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 
ACGTGCGCTA TTkTGAATTG CAACAAAACG G CGAAmC CAT 
TATATGGATT TGTACGCACG CGAAGG C AAA CGCGGCGGCG 
CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 
CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 
CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 
GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 
ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 
CGGcgTTCCC yTGCCGAAAG AACTCTT sGA CAAAwTGCTC 
ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 
ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGT ATT G A 
ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
TTCAAAGCCT TCCGCGGCCG CGAACCGAGC AT AG ACG C AC 
CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml28 . pep (partial ) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



ml28 . seq 


(partial ) 


1 


ATGACTGACA 


51 


AATCAAAACC 


101 


CGCGCGAACA 


151 


AACACTGTCG 


201 


GGGCGTGGTG 


251 


CCGTCTATAA 


301 


GGACAAGACA 


351 


CGAATT CGAC 


1 


TACGCCAGCG 


51 


wGTCAAAAAA 


101 


AAmTCAAAAA 


151 


TGGCACAAAG 


201 


AGGCGGCGTT 


251 


CGTGGATGAA 


301 


CAAyTGCCCA 


351 


CAGGGAAGCC 


401 


CCGGACACGG 


451 


TCCGGCATCA 


501 


TATGGAAAAT 


551 


ACGAAGAAAC 


601 


GCCGCCAAAA 


651 


CGCCCTCTTT 


701 


AAAACTGG C A 


751 


CAGCCGCCCG 


801 


AGGCGGCTAT 


851 


GCGCGGACGC 


901 


GGCAAACGCT 


951 


nGCAGAATCC 


1001 


TCTTGCGCCA 



1 Y AS E KL RE A K YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMNDYK GRRUFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 

301 GKRFWQEILA VGXSRSGAES FKAFRGREPS I DALLRHSGF DNAV* 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3087>: 

gl28.seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CG CGCGG AC A AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

2 51 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACC AT C A AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 C CAAAACGT C CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCT AC CG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATT CGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

9 51 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

10 01 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

12 01 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
1251 CG CGTGG AT G AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

13 01 TGCAACTGC.C CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 
1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatCtTC 
1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 
1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 3088; ORF 128.ng>: 

gl28 .pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDD AAP LAG I PEDALAMFAA 

2 01 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 
251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

3 01 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 
351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 I GGVYMDLY A REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 
4 51 GKEARLSHDE I LTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMI YS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
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651 AAESFKAFRG REPSIDALLR QSGFDNAA* 



ORF 128 shows 91 .7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 



10 20 30 40 50 60 

gl2 8 .pep M I DNALLHLGEE PRFNQ I QTEDI KPAVQTAI AEARGQ I AAVKAQTHTGW ANTVERLTG I T 

I | I M! I M I , M ' -I U I I I! I ':'M I | j | ; | |] [ : | | || I I I I I I 1 I I MM! 
ml2 8 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl2 8 .pep ERVGRI WGVVSHLiNS VVDTPELRAVYNELMPE I TVFFTEIGQDI ELYNRFKTI KNS PEFA 

MMMMMMM I = M M M M M M M M M M M I M M M M M M M M M I 

ml2 8 ERVGRI WGVVSHLNCVADTPELRAVYNELMPEITVFFTE I GQDI ELYNRFKTI KNS PEFD 

70 80 90 100 110 120 



130 140 150 160 170 180 

gl2 8 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLrQTEGAQLSAKFSQNVLDATDAFGIY 

IMMMMhl 

ml 2 8 TLSPAQKTKLNH 

130 

// 

340 350 360 

g!2 8 .pep Y AGE KLREAKY AF SET E V KKY F P VG KVLAG 

I ' : M I I I i I I I I I I I MMMM I I I 
ml 2 8 YAS E KLREAKYAF S E TX VKKY FPVGXVLNG 

10 20 30 



370 380 390 400 410 420 

gl28 . pep L FAQ I K KL YG I G FAE KTV P VWH KD VR Y FE LQQNG KT I GGVYMD L Y ARE G KRGG AWMND Y K 

MM I I I M I I I H I M 1 1 1 1 1 I 1 1 I IIMM.-I MMM I Ml MMliM M Ml 

ml28 L» F AQX KKL YG I G FTE KTV P VWH KDVR YX E DQQNG EX I GGVYMD LYARE G KRGG AWMND Y K 

40 50 60 70 80 90 



430 440 450 460 470 480 

gl28 . pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDE I LTLFHETGHGLHHLLTQVDELGV 

MMMMMMMMMMMMMMMMMMM MMMMMMMMMM! 

ml2 8 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVDEIiGV 

100 110 120 130 140 150 



490 500 510 520 530 540 

gl28 . pep SGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

MMM MMMMMMMMMMMI MMMI MMM I I I MINI Ml 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQXGMF 

160 170 180 190 200 210 



550 560 570 580 590 600 

gl28 . pep LVRQMEFALFDMMIYSESDECRLKNWQQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

Ml M I M M M M I = M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • I 1 1 1 1 1 1 I! 1 1 1 I M M M M M 

ml2 8 XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIFAGGY 

220 230 240 250 260 270 



610 620 630 640 650 660 

gl28 . pep SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRGREPS 

Ih MMMMMMMMMMMMMMMMMMM MIMMMMMMM 

ml28 SAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRGREPS 

280 290 300 310 320 330 
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670 679 
gl2 8 .pep IDALLRQSGFDNAAX 

IMMhlllllh 
ml2 8 IDALLRHSGFDNAVX 

340 



The following partial DNA sequence was 

a!28 . seq 

1 ATGACTGACA ACGCACTGCT 

51 AATCAAAACC GAAGACATCA 

101 CGCGCGAACA AATCGCCGCC 

151 AACACTGTCG AACCCCTGAC 

201 GGGCGTGGTG TCGCACCTCA 

251 CCGCCTACAA TGAATTAATG 

301 GGACAAGACA TCGAGCTGTA 

351 CGAGTTCGAC ACCCTCTCCC 

4 01 TGCGCGATTT CGTCCTCAGC 

4 51 GAATTGGCAA AACTGCAAAC 

501 CCAAAACGTC CTAGACGCGA 

551 CCGCACCGCT TGCCGGCATT 

601 GCCGCGCAAA GCGAAGGCAA 

651 GCACTACCTC GCCGTCATCC 

7 01 AAATCTACCG CGCCTACGTT 

7 51 AAATTCGACA ACACCGCCAA 

801 AACCGCCAAA CTGCTCGGCT 

851 CCAAAATGGC GGACACCCCC 

901 GCCCGCCGCG CCAAACCCTA 

951 CTTCGCCCGC GAAAGCCTCG 

1001 GCTACGCCGG CGAAAAACTG 

1051 GAAGTCAAAA AATACTTCCC 

1101 CCAAATCAAA AAACTCTACG 

1151 TCTGGCACAA AGACGTGCGC 

1201 ATAGGCGGCG TTTATATGGA 

1251 CGCGTGGATG AACGACTACA 

1301 TGCAACTGCC CACCGCCTAC 

1351 GGCAAAGAAG CCCGCTTGAG 

14 01 AACCGGACAC GGCCTGCACC 

14 51 TATCCGGCAT CAACGGCGTA 

1501 TTTATGGAAA ATTTCGTTTG 

1551 CCACGAAGAA ACCGGCGTTC 

1601 TCGCCGCCAA AAACTTCCAA 

1651 TTCGCCCTCT TTGATATGAT 

1701 GAAAAACTGG CAACAGGTTT 

17 51 TCCGACCGCC CGAATACAAC 

1801 GCAGGCGGCT ATTCCGCAGG 

1851 GAGCGCGGAC GCATACGCCG 

1901 CAGGCAAACG CTTTTGGCAG 

1951 GCGGCAGAAT CCTTCAAAGC 

2001 ACTCTTGCGC CACAGCGGCT 



identified in N. meningitidis <SEQ ID 3089>: 



CCATTTGGGC 
AACCCGCCCT 
ATCAAAGCCC 
CGGCATCACC 
ACTCCGTCAC 
CCCGAAATTA 
CAACCGCTTC 
ACGCGCAAAA 
GGCGCGGAAC 
CGAAGGCGCG 
CCGACGCGTT 
CCCGAAGACG 
AACAGGCTAC 
AATACGCCGA 
ACCCGCGCCA 
CATCGACCGC 
TCAAAAACTA 
GAACAAGTTT 
CGCCGAAAAA 
GCCTCGCCGA 
CGCGAAGCCA 
CGTCGGCAAA 
GCATCGGATT 
TATTTTGAAT 
TTTGTACGCA 
AAGGCCGCCG 
CTCGTCTGCA 
C CAT G AC G AA 
ACCTGCTTAC 
GAATGGGACG 
GGAATACAAT 
CCCTGCCGAA 
CGCGGAATGT 
GATTTACAGC 
TAGACAGCGT 
CGCTTCGCCA 
CTATTACAGC 
CCTTTGAAGA 
GAAATCCTCG 
CTTCCGCGGA 
TCGACAACGC 



GAAGAACCCC 
GCAAACCGCC 
AAACGCACAC 
GAACGCGTCG 
CGACACGCCC 
CCGTCTTCTT 
AAAACCAT C A 
AACCAAACTC 
TGCCGCCCGA 
CAACTTTCCG 
CGGCATTTAC 
CGCTCGCCAT 
AAAATCGGTT 
CAACCGCAAA 
GCGAGCTTTC 
ACGCTCGAAA 
CGCCGAATTG 
TAAACTTCCT 
GACCTCGCCG 
TTTGCAACCG 
AATACGCATT 
GTATTAAACG 
TACCGAAAAA 
TGCAACAAAA 
CGCGAAGGCA 
CCGTTTTTCA 
ACTTCACCCC 
ATCCTCACCC 
CCAAGTCGAC 
CAGTCGAACT 
GTCTTGGCGC 
AGAACTCTTC 
TCCTCGTCCG 
GAAGACGACG 
GCGCAAAGAA 
ACAGCTTCGG 
TACGCGTGGG 
AAGCGACGAT 
CCGTCGGCGG 
CGCGAACCGA 
GGCTTGA 



GTTTTGATCA 
ATTGCCGAAG 
CGGCTGGGCA 
GCAGGATTTG 
GAACTGCGCG 
CACCGAAATC 
AAAACTCCCC 
AACCACGATC 
ACAGCAGGCA 
CCAAATTCTC 
TTTGACGATG 
GTTTGCCGCT 
TGCAGATTCC 
CTGCGCGAAC 
AGACGACGGC 
ACGCCCTGCA 
TCGCTGGCAA 
GCACGACCTC 
AAGTCAAAGC 
TGGGACTTGG 
CAGCGAAACC 
GACTGTTCGC 
ACCGTCCCCG 
CGGCGAAACC 
AACGCGGCGG 
GACGGCACGC 
GCCCGTCGGC 
TCTTCCACGA 
GAACTGGGCG 
GCCCAGTCAG 
AAATGTCCGC 
GACAAAATGC 
CCAAATGGAG 
AAGGCCGTCT 
GTCGCCGTCG 
CCACATCTTC 
CGGAAGTATT 
GTCGCCGCCA 
ATCGCGCAGC 
GCATAGACGC 



This corresponds to the amino acid sequence <SEQ ID 3090; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDTELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLG FKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 ORFs 128 and 128.a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml2 8 .pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
I I I I I I I I I I I I M I I II I I M M I I I M I I I I I i I 1 I I I ! I ! M I I I I M I II I I M I 1 
a l28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml28 .pep ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I I I I I I I I I : I 1 M I I I : I I I II I I I I M I I I I I I I I I I I I I I I M M I I I ! I 
a 12 8 ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 

ml 2 8 . pep TLS PAQKTKLNH 

III I I I I I I M 

a l2 8 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



ml AM. pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



ml28.pep 

a l28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

140 150 

ml28 . pep YASEKLREAKYAFSETXVKKY FPVGX 

I I : I I I I I I It I I I I I M I I I M I 
a 128 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

160 170 180 190 200 210 

ml28 . pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
I | | | | I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I : I M I I I I I I II I 1 I I I I I II 
a 128 VLNGLFAQIKKLYGIGFTEKTV PVWHKDVRY FELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I I I I I I I I I I II I I I M I I I I I I I I : I I I I I : I I I I I M I I I I I I I II II I M M I I I I 
a 128 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

280 290 300 310 320 330 

ml28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I I i I M I I I I II M I I I II It I M I I I I I I I I I I I I I II I II I M I I II I I I M I I 
a 128 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

340 350 360 370 380 390 

ml28 . pep XGMFXVRQXEFALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

Ml III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I :: I I I I I I I I I I I I I I 
al28 RGMFLVRQME FALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 

550 560 570 580 590 600 
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400 410 420 430 440 450 

ml28 . pep AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 
MINI: I I 1 It I M I M I I II I I I M ! It I M I II I I I I I I M 1 II I : I I I I I I I I I 
al2 8 AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 

610 620 630 640 650 660 

460 470 
ml2 8 .pep REPSIDALLRHSGFDNAVX 

I I I I I I II I I I I I I I I I : 
al28 REPSI D ALLRH SG FDNAAX 

670 



Further work revealed the DNA sequence identified in TV. meningitidis <SEQ ID 309 1>: 

ml28-l . seq 

1 AT G ACT G AC A ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAAC CAT C A AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCACGATC 

401 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 ■ CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

651 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AG AC G AC G G C 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT T C AAAAACT A CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

14 01 AACCGGACAC GGGCTGCACC ACCTG CTT AC CCAAGTGGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TT GAT AT GAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 

17 51 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-1>: 

ml28-l .pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 
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101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



GQDIELYNRF 
ELAKLQTEGA 
AAQSESKTGY 
KFDNTANIDR 
ARRAKPYAEK 
EVKKYFPVGK 
IGGVYMDLYA 
GREARLSHDE 
FMENFVWEYN 
FALFDMMIYS 
AGGYSAGYYS 
AAESFKAFRG 



KTIKNSPEFD 
QLSAKFSQNV 
KIGLQIPHYL 
TLANALQTAK 
DLAEVKAFAR 
VLNGLFAQIK 
REGKRGGAWM 
ILILFHETGH 
VLAQMSAHEE 
EDDEGRLKNW 
YAWAEVLSAD 
REPSIDALLR 



TLSPAQKTKL 
LDATDAFGIY 
AVIQYADNRE 
LLGFKNYAEL 
ESLNLADLQP 
KLYGIGFTEK 
NDYKGRRRFS 
GLHHLLTQVD 
TGVPLPKELF 
QQVLDSVRKK 
AYAAFEESDD 
HSGFDNAV* 



NHDLRDFVLS 
FDDAAPLAGI 
LREQIYRAYV 
SLATKMADTP 
WDLGYASEKL 
TVPVWHKDVR 
DGTLQLPTAY 
ELGVSGINGV 
DKMLAAKNFQ 
VAVIQPPEYN 
VAATGKRFWQ 



GAELPPEQQA 
PEDALAMFAA 
TRASELSDDG 
EQVLNFLHDL 
REAKYAFSET 
YFELQQNGET 
LVCNFAPPVG 
EWDAVELPSQ 
RGMFLVRQME 
RFALSFGHIF 
EILAVGGSRS 



The following DNA sequence was i 



gl28-l.seq (partial) 




1 


ATGATTGACA 


ACGCACTGCT 


51 


AATCAAAACC 


G AAG AC AT C A 


101 


CGCGCGGACA 


AATCGCCGCC 


151 


AACACCGTCG 


AGCGTCTGAC 


201 


GGGCGTCGTG 


TCCCATCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAACTGTA 


351 


CGAATTTGCA 


ACGCTTTCCC 


401 


TGCGCGATTT 


CGTATTGAGC 


451 


GAACTGGCAA 


AACTGCAAAC 


501 


CCAAAACGTC 


CTAGACGCGA 


551 


CCGCACCGCT 


TGCCGGCATT 


601 


GCCGCGCAAA 


GCGAAGGCAA 


651 


GCACTACCTT 


GCCGTTATCC 


701 


AAATCTACCG 


CGCCTACGTT 


751 


AAATTCGACA 


ACACCGCCAA 


801 


AACCGCCAAA 


CTGCTCGGCT 


851 


CCAAAATGGC 


GGACACGCCC 


901 


GCCCGCCGCG 


CCAAACCCTA 


951 


CTTCGCCCGC 


GAACACCTCG 


1001 


GCTACGCCGG 


CGAAAAACTG 


1051 


GAAGTCAAAA 


AATACTTCCC 


1101 


CCAAATCAAA 


AAACTCTACG 


1151 


TCTGGCACAA 


AGACGTGCGC 


1201 


ATCGGCGGCG 


TTTATATGGA 


1251 


CGCGTGGATG 


AACGACTACA 


1301 . 


TGCAACTGCC 


CACCGCCTAC 


1351 


GGCAAAGAAG 


CGCGTTTAAG 


1401 


AACCGGCCAC 


GGACTGCACC 


1451 


TGTCCGGCAT 


CAACGGCGTA 



ied in N. gonorrhoeae <SEQ ID 3093>: 

CCACTTGGGC GAAGAACCCC GTTTTAATCA 
AACCCGCCGT CCAAACCGCC ATCGCCGAAG 
GTCAAAGCGC AAACGCACAC CGGCTGGGCG 
CGGCATCACC GAACGCGTCG GCAGGATTTG 
ACTCCGTCGT CGACACGCCC GAACTGCGCG 
CCTGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAGCTC GATCACGACC 
GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGTTAC AAAATCGGCT TGCAGATTCC 
AATACGCCGG CAACCGCGAA CTGCGCGAAC 
ACCCGTGCCA GCGAACTTTC AAACGACGGC 
CATCGACCGC ACGCTCGAAA ACGCATTGAA 
TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
GAACAGGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGT CAAAGC 
GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 
CGCGAAGCCA AATACGCATT CAGCGAAACC 
CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 
GCATCGGATT CGCCGAAAAA ACCGTTCCCG 
TATTTTGAAT TGCAACAAAA CGGCAAAACC 
TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
AAGGCCGCCG CCGCTTTGCC GACGGCACGC 
CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 
CCACGACGAA ATCCTCACCC TCTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
AAA 



This corresponds to the amino acid sequence <SEQ ID 3094; ORF 128-l.ng>: 

gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGVV SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 



ml28-l/gl28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 491 aa 
overlap 

10 20 30 40 50 60 
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gl28-l .pep MIDNALLHLGEEPRFNQIKTEDIKPAVQTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 
I I I I I I I 11 I I I M : I I I i I I I I i I : I I M I I M I I I I : II I M I I I I I M I I I I I I 
ml 2 8-1 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl2 8-l .pep ERVGRIWGVVSHLNSVVDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

M i i I M I i I M i I I I : M I I I i M i I I i I i I I I i I M M i I I I I II I I II I M I I i M 

ml28-l ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 

g!2 8-l .pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDAT DAFGIY 
M I I II I I I I : I I I 9 I I I I I I I II I I I : I I I 1 I I I II ! I M I I I I I I I i I I I I I I I I I I I 
ml2 8-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAE LAKLQTEGAQLSAKFSQNVLDAT DAFGIY 

130 140 150 160 170 180 

190 200 210 220 230 240 

gl28-l .pep FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYAGNRELREQI YRAYV 
M I I I I I M I I II I J II I II I I I I I : I II I I I M I I I I I I I 11 I I I I I I I I I I I I I I I I 
ml 2 8 - 1 FDDAAPLAGI PEDALAMFAAAAQSESKTGYKIGLQI PHYLAVIQYADNRELREQI YRAYV 

190 200 210 220 230 240 

250 260 270 280 290 300 

g 1 2 8 - 1 . pep TRASELSN DGKFDNTANI DRTLENALKTAKLLG FKN YAELS LATKMADT PEQVLN FLHDL 
I I M I I I : I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I 
ml2 8-l TRASELSDDGKFDNTAN I DRTLANALQTAKLLGFKNYAELSLATKMADT PEQVLN FLHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

gl28-l . pep ARRAKPYAEKDLAEVKAFAREHLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 

I I 11 I I II I I I I II I I I I M I 1:111 I I II I : I I : M I I I I I I I I I I I I I I I I I I M I 
ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

370 380 390 400 410 420 

gl2 8-l.pep VLAGLFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 

II I I II I I I I I I I I I I : I I I I I M I I I I I I I I I I I I I : I II I I I I M I I I I II I I M I I 
ml2 8-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

430 440 450 460 470 480 

gl28-l .pep NDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I I I I I II I I : I I I I I I I I I I I I I I M I I I I I : M I I I I I I I I II I I I I I I I I M I I I I I 
m!2 8-l NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 

gl28-l .pep ELGVSGINGVK 
I I I I I I I I I I : 

ml2 8-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3095>: 

al28-l . seq 

1 AT G ACT G AC A ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAAC CAT C A AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
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501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

17 01 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

17 51 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 



This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-1. a>: 

al28-l .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28-l/al28-l ORFs 128-1 and 128-1. a showed a 97.8% identity in 677 aa overlap 



10 20 30 40 50 60 

al28-l .pep MT DNALLHLGEE PRFDQIKTE DI KPALQTAI AEAREQI AAI KAQTHTGWANTVE PLTG IT 
I I I M E I I I I I I I I I M I I I M I I t I I I I II I I M I I I I I I ! I I II I I I I I I ! 11 I I I ! I 
ml28-l MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 



70 80 90 100 110 120 

al 28-1 .pep ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I II I I I I M 1 : I I I I II I : I I I I I I I I I M I I I I I I I I I I I M I I I I I I M M I 
ml2 8-l ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 140 150 160 170 180 
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al28-l . pep TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGI Y 
IN I I I I I I I I I I M I I I I II ! I I I I ! I II I I I I I I I I I I I I I M I I M I I I I t I I I I I 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

130 140 150 160 170 180 



190 200 210 220 230 240 

a 1 2 8 - 1 . pep FDDAAPLAG I PEDALAMFAAAAQSEGKTGYKIGLQI PH Y LAVI QYADNRKLREQI YRAYV 
I I I I I II I I I I I I I I I I I I I II I! I : I I I I I I I II I I I M I M 1 I I I I I : I I II I I I I I I 
ml28-l FDDAAPLAG I PEDALAMFAAAAQSESKTGYKIGLQIPHYLAVIQYADNRELREQI YRAYV 

190 200 210 220 230 240 



250 260 270 280 290 300 

al28-l.pep T RASELS DDGKFDNT AN I DRTLEN ALQTAKLLG FKN YAELSLATKMADT PEQVLN FLHDL 
I I I I I I I I II I I I I I I I I I I I I I I M I I I I M I I I II I I I I i I I I I I I I I I I I 1 I I I I I 
ml28-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 



310 320 330 340 350 360 

al28-l . pep ARRAKPYAEKDLAEVKAFARESTLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
I I M I I I I I I I I I I I I I I I I I I I : I I I I I I I I I II I : I I I I I M I I I I I I I I M I I I I I I 
ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

370 380 390 400 410 420 

al2 8-l . pep VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
I I I I I II i I I I II I M I I! I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I II I M I 
ml28-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 



430 440 450 460 470 480 

al28-l.pep NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
M I I I I M I I I I I I I I I I I I I I I I I : I I M I : I I I I I I I I I I I I I I I I M I I I I 1 I I M 
ml2 8-l NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 500 510 520 530 540 

al28-l . pep ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I M M I I I I I I I I I I I 
ml28-l ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 



550 560 570 580 590 600 

a!28-l . pep RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 
I I I I I II I I I I I I I I I t I I M I I M I I I I I I I 1 I I I I I I : II 1 :: i I I I I I I I I I II I I 
ml28-l RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

550 560 570 580 590 600 



610 620 630 640 650 660 

al28-l .pep AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
I I I I I I I I I M I I I 1 II I M I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I M I 
ml 2 8 - 1 AGGYSAGYYS YAW AEVLS ADA YAA FEES DDVAATGKRFWQE I LAVGGSRS AAES FKAFRG 

610 620 630 640 650 660 



670 679 
a 1 2 8 - 1 . pep RE PS I DALLRHSGFDNAAX 

I I I I I I II I I I I I I I I I : 
ml 2 8 - 1 RE PS I DALLRHSGFDN AVX 

670 



206 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3097>: 

m206 . seq 
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1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AG AC AG T C C G GCAAATCCAA. GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGG AAC T CAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 C T AC AAATGG GGCGGCAGCA GCACCGCAAC CGG CTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAr AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGsAAA ATCCCCGAcA GCCGCyTCAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

401 ACGT CGG ACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

4 51 GGCAAAACCA T CAAAAC CG A AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 
m2 06 . pep . • 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3099>: 

g206 . seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

2 01 ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

351 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

401 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 

4 51 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttt acgccaaaaa 

501 ctaccttgga gcgcatacgt tttttacaga atga 
This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206.ng>: 

g206 . pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIGRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD I V F FNTGGAH RYSHVGLYIG NGEFIHAPGS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from N. gonorrhoeae: 

m206/g206 

10 20 30 40 50 60 

m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

II I I IN I I i hi I III MINIMI 1,1 I I: III i I li I II MM i I I ! 1 M 1 1 t I I 

g206 MFSPDKTLFLCLGALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIGRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m206 . pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKI PDSRXKAGD 

IMMIII Ml MM M I MM M! Ml I II III III ! I! ! I M I MM I M IM I Ml! 

g206 LG L I GTP YKWGGS S TATGFDCSGMI QLVYKNALNVKLPRTARDMAAASRKI PDS RLKAGD 

70 80 90 100 110 120 



130 140 150 160 170 

m206 . pep LV FFNTGGAHRYSHVGL Y I GNGE F I HAPS SG KT I KTE KLST PFY AKNYLGAHTFFTEX 

M II II MMM III IM MM MMMM I MIMI I II Ml I I II II I MM Ml 

g206 IVFFNTGGAHRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 
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130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3101>: 

a206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

2 01 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

351 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 

4 51 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 

501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102; ORF 206. a>: 

a20 6.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGS QELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 ORFs 206 and 206.a showed a 99,4% identity in 177 aa overlap 

10 20 30 40 50 60 

m20 6.pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHI DRTQGSQELMLHS 

I i I I II I I I M I I I I I I I I I I I! M II I 1 I 1! I I ! I I I I M M M I ! I I I M I I I I M I I 
a2 0 6 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRI SHI DRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m206. pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDbdAAASRKIPDSRXKAGD 
I I 1 I M I I I M I I I I I I I I I I I I M I I I I I I I I i 1 I I I I M M 11 M I I I I II I I I I I I 
a 20 6 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206 .pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I I I I I II I I I I ! I I M I I I I I I I I I I II I I ! I I ! I I I I I I I I I I I I I I I I I I I I I I I I 
a 20 6 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



287 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3103>: 

m287 . seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

2 01 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

351 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 

4 51 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 

501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 

551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 
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601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 GCAAAATATA ACGTTGACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

7 01 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

7 51 GATGCAGACA AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG GTTGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCACGGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTCAAGGCGA 

1101 ACCGGCAAAA GGCGAAATGC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CC AT ACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGATTT CGGCAGCAAA TCTGTGGACG GCATTATCGA 

12 51 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAATTCAAA GCCGCCATCG 

1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

1351 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 

1401 CTATCGCCCG ACAGATGCGG AAAAGGG CGG ATTCGGCGTG TTTGCCGGCA 

14 51 AAAAAG AG C A GGATTGA 



This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m287 .pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

351 - GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 

4 51 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 



The following partial DNA sequence was 

g287 . seq 

1 atgtttaaac gcagtgtgat 

51 ctgtgggggc ggcggtggcg 

101 cgtcaaaacc ggccgccccc 

151 ctgccgaaag aaaagaaaga 

201 cgatacgcag gacgcaaccg 

251 tttcggcaga aaatacaggc 

301 aaaaatgaag acgcgggggc 

351 atccgcaaat caaacaggga 

401 cccccgcgtc aaaccctgcc 

451 acgaacgtgg gcaattctgt 

501 gttgacccac tgtaaaggcg 

551 aagaagcacc gtcaaaatca 

601 attaagcgat ataaaaaaga 

651 tgctgacagg gtaaaaaagg 

701 cggacaaacc acctactcgt 

751 gagattccgc tgattcccgt 

801 ggaagcggtc agcctgacgg 

851 ggaattaccg gtatctgact 

901 tatgccctcc gtgtgcaagg 

951 cacggccgtg tacaacggcg 

1001 gtccgtaccc gtccggaggc 

1051 aaatctgtgg acggcattat 

1101 gcaaaaattc aaagccgcca 

1151 cggaaaatgg cggcggggat 

1201 gaggaagtgg cgggaaaata 

1251 cggattcggc gtgtttgccg 



identified in N. gonorrhoeae <SEQ ID 3105>: 



tgcaatggct 
gatcgcccga 
gttgttgctg 
tgaggaggca 
ccggagaagg 
aatggcggtg 
gcaaaatgat 
acaaccaacc 
cctgcgaatg 
tgtgattgac 
attcttgtaa 
gaatttgaaa 
cgagcaacgg 
atggaactaa 
tctgcacggt 
caatcaggcc 
ggcattccgg 
tacggggcgg 
cgaaccggca 
aagtgctgca 
aggtttgccg 
cgacagcggc 
tcgatggaaa 
gtttccggaa 
cagctatcgc 
gcaaaaaaga 



tgtatttttc 
tgtcaagtcg 
aaaatgccgg 
gcgggcggtg 
cagccaagat 
cggcaacaac 
atgccgcaaa 
cgccggttct 
gcggtagcga 
ggaccgtcgc 
tggtgataat 
aattaagtga 
gagaattttg 
caaatatatc 
cgaggaggtc 
gatacgctga 
caatatcttc 
aaaaattgcc 
aaaggcgaaa 
tttccatatg 
caaaagtcga 
gatgatttgc 
cggctttaag 
ggttttacgg 
ccgacagatg 
tcgggattga 



ccctttcagc 
gcggacacgc 
ggaaggggtg 
cgccgcaagc 
atggcggcag 
ggacaacccc 
atgccgccga 
tcagattccg 
ttttggaagg 
aaaatataac 
ttattggatg 
tgaagaaaaa 
tcggtttggt 
atcttctata 
gcttccggcc 
ttgtggatgg 
gcgcccgaag 
cggcggatcg 
tgcttgttgg 
gaaaacggcc 
tttcggcagc 
atatgggtac 
gggacttgga 
cccggccggc 
ctgaaaaggg 



This corresponds to the amino acid sequence <SEQ ID 3 106; ORF 287.ng>: 

g287 .pep 

1 MFKRSVIAMA CIFPLSA CGG GGGGSPDVKS ADTPSKPAAP WAENAGEGV 
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51 LPKEKKDEEA AGGAPQADTQ DATAGEGSQD MAAVSAENTG NGGAATTDNP 

101 KNEDAGAQND MPQNAAESAN QTGNNQPAGS SDSAPASNPA PANGGSDFGR 

151 TNVGNSWID GPSQNITLTH CKGDSCNGDN LLDEEAPSKS EFEKLSDEEK 

201 IKRYKKDEQR ENFVGLVADR VKKDGTNKYI IFYTDKPPTR SARSRRSLPA 

251 EIPLIPVNQA DTLIVDGEAV SLTGHSGNIF APEGNYRYLT YGAEKLPGGS 

301 YALRVQGEPA KGEMLVGTAV YNGEVLHFHM ENGRPYPSGG RFAAKVDFGS 

351 KSVDGIIDSG DDLHMGTQKF KAAIDGNGFK GTWTENGGGD VSGRFYGPAG 

401 EEVAGKYSYR PTDAEKGGFG VFAGKKDRD* 



m287/g287 ORFs 287 and 287.ng showed a 70.1% identity in 499 aa overlap 

10 20 30 40 49 

m287 . pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

MINIMUM I M M I M M M M I M M II I I I I I! : I I : II 

g 2 87 MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 



50 60 70 80 90 100 109 

m2 87 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGT 

MM : I I : : : M I I I I I I I I I I M I M : f : I I I I I M t I I I I I II M 
g 287 AGGAPQADTQD — ATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAA — 

70 80 90 100 110 

110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 



g287 



170 180 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 
: : M I : I I M Mill I I I I I I I I I 1 I I : I I I : : : : I : I : I I II I I I M I I I I I I I I 
g 2 87 -ES AN QTGNNQPAGS SDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDS 

120 130 140 150 160 170 



230 24 0 250 2 60 270 280 28 9 

m287 .pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

I : | : | : I I I I : I I I II II I I : I I : Mil : : : I I I I II I I : I (=11111 
g 287 CNGDNLLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTD 
180 190 200 210 220 230 

290 300 310 320 330 340 349 

m287 . pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 
I I : I M I II I II I I I : I I I I I I I M I I I I I I II I I II I I II II II I I I I II I I I 

g287 KPPT RSARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLT 

240 250 260 270 280 290 



350 360 370 380 390 400 409 

m28 7 .pep YGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGS 
I | | | M II I I I I I I I I I I I I I M II : I : I I I I I I II I M I I I I I I I : I I I II I I II I I 
g287 YGAEKLPGGSYALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 

300 310 320 330 340 350 



410 420 430 440 450 460 469 

m287 .pep KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 
I II II I I I I I I I I I E I I I I I I I I I M I I I I I I M I I I : I M I I : I I I M I 11 I If II II I 
g 2 87 KSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 

360 370 380 390 400 410 



m2 87 .pep 



470 480 489 

PT DAEKGG FG V FAGKKEQDX 
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I I I I I I I II I M II I t : : I I 
g 2 8 7 PT DAE KGG FG V FAGKKDRDX 

420 430 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3107>: 

a287 .seq 

1 ATGTTTAAAC GCAGTGTGAT TGCAATGGCT TGTATTGTTG CCCTTTCAGC 

51 CTGTGGGGGC GGCGGTGGCG GATCGCCCGA TGTTAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTACTG AAGATGTCGG GGAAGAGGTG 

151 CTGCCGAAAG AAAAGAAAGA TGAGGAGGCG GTGAGTGGTG CGCCGCAAGC 

201 CGATACGCAG GACGCAACCG CCGGAAAAGG CGGTCAAGAT ATGGCGGCAG 

2 51 TTTCGGCAGA AAATACAGGC AATGGCGGTG CGGCAACAAC GGATAATCCC 
301 GAAAATAAAG ACGAGGGACC GCAAAATGAT ATGCCGCAAA ATGCCGCCGA 

3 51 TACAGATAGT TCGACACCGA ATCACACCCC TGCACCGAAT ATGCCAACCA 

4 01 GAGATATGGG AAACCAAGCA CCGGATGCCG GGGAATCGGC ACAACCGGCA 
4 51 AACCAACCGG ATATGGCAAA TGCGGCGGAC GGAATGCAGG GGGACGATCC 
501 GTCGGCAGGG GAAAATGCCG GCAATACGGC AGATCAAGCT GCAAATCAAG 
551 CTGAAAACAA TCAAGTCGGC GGCTCTCAAA ATCCTGCCTC TTCAACCAAT 
601 CCTAACGCCA CGAATGGCGG CAGCGATTTT GGAAGGATAA ATGTAGCTAA 
651 TGGCATCAAG CTTGACAGCG GTTCGGAAAA TGTAACGTTG ACACATTGTA 
7 01 AAGACAAAGT ATGCGATAGA GATTTCTTAG ATGAAGAAGC ACCACCAAAA 
7 51 TCAGAATTTG AAAAATTAAG TGATGAAGAA AAAATTAATA AATATAAAAA 
801 AG AC GAG C AA CGAGAGAATT TTGTCGGTTT GGTTGCTGAC AG G GT AG AAA 
851 AGAATGGAAC TAACAAATAT GTCATCATTT ATAAAGACAA GTCCGCTTCA 
901 TCTTCATCTG CGCGATTCAG GCGTTCTGCA CGGTCGAGGC GGTCGCTTCC 
951 GGCCGAGATG CCGCTGATTC CCGTCAATCA GGCGGATACG CTGATTGTCG 

1001 ATGGGGAAGC GGTCAGCCTG ACGGGGCATT CCGGCAATAT CTTCGCGCCC 

1051 GAAGGGAATT ACCGGTATCT GACTTACGGG GCGGAAAAAT TGTCCGGCGG 

1101 ATCGTATGCC CTCAGTGTGC AAGGCGAACC GGCAAAAGGC GAAATGCTTG 

1151 CGGGCACGGC CGTGTACAAC GGCGAAGTGC TGCATTTCCA TAT GG AAAAC 

1201 GGCCGTCCGT CCCCGTCCGG AGGCAGGTTT GCCGCAAAAG TCGATTTCGG 

1251 CAGCAAATCT GTGGACGGCA TTATCGACAG CGGCGATGAT TTGCATATGG 

1301 GTACGCAAAA ATTCAAAGCC GTTATCGATG GAAACGGCTT TAAGGGGACT 

1351 TGGACGGAAA ATGGCGGCGG GGATGTTTCC GGAAGGTTTT ACGGCCCGGC 

14 01 CGGCGAAGAA GTGGCGGGAA AATACAGCTA TCGCCCGACA GATGCGGAAA 

14 51 AGGGCGGATT CGGCGTGTTT GCCGGCAAAA AAGAGCAGGA TTGA 

This corresponds to the amino acid sequence <SEQ ED 3108; ORF 287. a>: 

a287 .pep 

1 MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP WTEDVGEEV 

51 LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

101 ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

151 NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

201 PNATNGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

251 SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 

301 SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

351 EGNYRYLTYG AEKLSGGSYA LSVQGE PAKG EMLAGTAVYN GEVLHFHMEN 

401 GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 

4 51 WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 

m287/a287 ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 

10 20 30 40 49 

m287 . pep MFKRS VI AMACI FALSACGGGGGGS PDVKS ADTLSKPAAP WSE KETEA 

MINIMUM I I I I I I I I I I I I II I ! I I I I I M i I I I I I : I I : II 

a 287 M FKRSVI AMACI VALSACGGGGGGS PDVKS ADTLSKPAAPWTEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADN PKNEDEVAQNDMPQNAAGT 

MM : I I : : : I : I I I I I I I I I I M I M : I : I I I : I : M M I II I II I I 
a287 VSGAPQADTQ — DAT AGKGGQDMAAV S AENTGNGG AATT DN PENKDEG PQN DM PQN AADT 

70 80 90 100 110 
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110 120 130 140 150 160 169 

m2 87 pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 
MINIMI ! I 1 -: : I Ml I II I M II M II M I M M II I I M I I I I :. I M I I I 
a 2 87 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 

170 180 190 200 210 220 229 

m287 .pep AQGANQAGNNQAAGSSDPI PASbJPAPANGGSNFGRVDLANGVLI DGPSQNITLTHCKGDS 

MINI I M : : M : : I : * I! : t II I : M i : : : 1 I I : : I : I : I : I I I I I I 
a 287 DQAANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 
180 190 200 210 220 230 

230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKP 

I : MINI: II I I I I I I I : I I : : I I I I : : : I I I M I I I : : I I : I : I : I I 
a 287 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKD 
240 250 260 270 280 290 

290 300 310 320 330 340 

m287 .pep KP--TSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
| : | M I I I M M M I I I I I II I II II M I I I II I II I I M II I I I I I I I I I I I I I I 
a 2 87 KSASSSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRY 
300 310 320 330 340 350. 

350 360 370 380 390 400 

m287 pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

|| I | | | M MINI I I I I II I I I I I I I : I I I I I I I I I I I I I I I I I : M II I I I I I 
a 2 87 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 

410 420 430 440 450 460 

m28 7 pep GSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

| | I I M' I I I M I I I M 1 M I I I I I : I I M I I M I I I 1 M : i I 1 I I : M I I I I I I I I I I I I 
a 2 87 GSKSVDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 
420 430' 440 4 50 4 60 470 

470 480 489 

m287 . pep YRPTDAEKGGFGVFAGKKEQDX 

I I I I I II I I I I I I I I I I I I I I I 
a 287 YRPTDAEKGGFGVFAGKKEQDX 
480 490 



406 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>: 

m406 . seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

. 201 CACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

3 01 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

3 51 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

4 01 CTCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 
4 51 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 
501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 
551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 
6 01 ATCGACGTAT TCGGAACGAT ACGCAACAGA AC CGAAATG C ACCTATACAA 
651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC G CAG TAG AC A 
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701 GAACCAATAA AAAATTGCTC AT C AAA CC AA AAACCAATGC GTTTGAAGCT 

7 51 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 10; ORF 406>: 

xn4 0 6 . pep 

1 MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

2 01 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 

2 51 A Y KENY ALWM GPYKVSKGIK PTEGLMVDFS DIRPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE WRQHRQGQP * 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31 1 1>: 

g406 . seg 

1 ATGCGGGCAC GGCTGCTGAT AC CT AT T CT T TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGT TG CAT TGTACATTGC 

2 01 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

2 51 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 
301 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

3 51 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

4 01 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGCTTAAAT 
4 51 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 
501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 
551 G CAT AG ACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 
601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 
6 51 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
701 GAACCAATAA AAAATTGCTC ATCAAACCCA AAACCAATGC GTTTGAAGCT 
751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 
801 AGGAATCAAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 
851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 
901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 
951 AGGGCAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 12; ORF 406>: 

g406 .pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

2 01 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 
251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

3 01 SHEGYGYSDE AVRQHRQGQP * 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
N. gonorrhoeae: 

g406/m406 

10 20 30 40 50 60 

g4 06 . pep MRARLLI PI LFSVFI LSACGTLTGI PS HGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

I Ml I M II I I I MIMMII III Ml II I I II I MIMI III III I II I II! I M MM 

m4 06 MQARLL I PI LFSVFI LSACGTLTGI PS HGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
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10 20 30 40 50 60 

70 80 90 100 110 120 

g4 0 6 pep KVALY I ATMGDQGSGSLTGGRYS I DALI RGE Y INS PAVRTDYTYPRYETTAETTSGGLTG 

I I 1 I I 1 1 1 ! 1 1 I 1 1 i 1 1 1 1 1 I I J I I 1 1 I ! I 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 ! I I M I I I E 1 1 1 1 

m4 06 KVALY I ATMGDQGSGSLTGGRYS I DAL I RGE Y INS PAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 



130 140 150 160 170 180 

g4 06 .pep LTTSLSTLNAPALSRTQSDGSGSRSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 e 1 1 1 1 1 1 = i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 i 1 1 f 1 1 i 1 1 i 1 1 1 

m4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 



190 200 210 220 230 240 

g406 . pep FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

I I f I 1 1 1 1 I 1 1 I I I I I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 I I t I ! I t 1 4 1 I f 1 ! 1 1 1 1 1 1 I t 1 1 1 1 1 1 1 

m4 06 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 



250 260 270 280 290 300 

g406 . pep I KP KTN AFE AA Y KENY AL WMG P Y KV S KG I KP TEG LMVD FSD I Q P YGNH TGN SAP S V E ADN 

I I ] 1 1 1 1 E i i 1 ! i 1 1 1 1 1 1 1 1 1 1 1 ! ! I 1 1 1 1 ^ 1 1 1 1 1 1 1 1 1 1 = 1 i 1 1 I 1 1 1 1 E 1 1 1 1 1 1 1 

m4 06 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 

250 260 270 280 290 300 



310 320 
g4 0 6 . pep SHEGYGYSDE AVRQHRQGQPX 

I I I I IIIM hi Ml : Mill 

m4 0 6 SHEGYGYSDEWRQHRQGQPX 

310 320 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 13>: 

a40 6 . seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

4 01 CGCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

4 51 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACGGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

7 01 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

7 51 GCCTATAAAG AAAATTACGC ATTGTGGATG GGACCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACAGAAG GATTAATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATATGGGT AACTCTGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC GACATAGACA 

951 AGGGCAACCT TGA 



This corresponds to the amino acid sequence <SEQ ID 31 14; ORF 406. a>: 

a4 06 . pep 

1 MQARLLIPIL FSVFILSA CG TLTGI PSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 
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251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEADN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

m406 . pep MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
i M I I I I I I i II I 1 I H I I M I It I I t I I ! M I I I I M I I it I I I i M I I! II II I I I M 
a 4 0 6 MQARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m406.pep KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 
I | | | | | I I I I I II I I I t I I I II I M I i I I I I II M I I I I 1 I I I M I I 1 I I I I I I I I I I I I 
a4 0 6 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m4 06.pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
I M I I I 1 I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a4 0 6 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m4 06.pep FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

M I I I I I I I I I I I M I I I I I I I I I I I M I 1 I M I I I I M II I I I I I I I M I I I I I I t I I I 

a4 0 6 FLRGIDWSPANADTDVFINIDV FGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m4 06.pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
I I M I I M I I I II f I I I I I I I M I I I I I I I i I I I II I I I I I I : M I I I I I 1 I ! I I M I I 
a 4 0 6 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHMGNSAPSVEADN 

250 260 270 280 290 .300 

310 320 
m4 06.pep SHEGYGYSDEWRQHRQGQPX 
I It I I I I 1 I I : I I : I I I I II I 
a 4 0 6 SHEGYGYSDEAVRRHRQGQPX 

310 320 



EXAMPLE 2 
Expression of ORF 919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 919 was cloned in pET vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
His and sera were used for Western blot (panel B), FACS analysis (panel C), bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; PP, 
purified protein, TP, N. meningitidis total protein extract; OMV, N. meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the N. meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions of ORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 
143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, 
Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORP 919 and the amino acid 
sequence encoded thereby is provided in Example 1. 

EXAMPLE 3 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the N. 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 1 1 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), 
and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 5 
Expression of ORF 519 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519. 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
His fusion protein purification. Mice were immunized with the purified 519-His and sera were 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 
ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis total 
protein extract; OMV, N. meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 519 is a surface-exposed protein 
and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1 :9). The nucleic 
acid sequence of ORF 519 and the amino acid sequence encoded thereby is provided in 
Example 1. 
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EXAMPLE 6 
Expression ofORF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121. 
The predicted gene 121 was cloned in pET vector and expressed in E, coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121- 
His fusion protein purification. Mice were immunized with the purified 121 -His and sera were 
used for Western blot analysis (panel B)> FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 121 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 121 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 7 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 128 was used to locate and clone ORF 128. 
The predicted gene 128 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the N. meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
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Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 8 
Expression of ORF 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gain information on 
the nature of this antibody staining we expressed ORF 206 in E. coli without the His-tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts from E. 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shown the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, N. meningitidis total protein extract; OMV, N. meningitidis outer membrane vesicle 
preparation. Arrows indicate the position of the main recombinant protein product (A) and the 
N. meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 519 are provided in Figure 16. The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1 . 
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EXAMPLE 9 
Expression of ORF 287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287. 
The predicted gene 287 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized with the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1 992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 287 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 10 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
His fusion protein purification. Mice were immunized with the purified 406-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 406 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, M 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 406 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 1 1 

Table 2 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 225 among different strains. 



Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 




Group B 




zo01_225 NG6/88 


R. Moxon / Seiler et al, 1996 


zo02 225 BZlys 


iv. ivioxon / oener et at., iyyo 


zo03 225 NG3/88 


R. Moxon / Seiler et al, 1996 


zo04 225 297-0 


R. Moxon / Seiler et al, 1996 


zo05 225 1000 


R. Moxon / Seiler et al, 1996 


zo06_225 BZ147 


R. Moxon / Seiler et al, 1996 


zo07 225 BZ169 


K. Moxon / cseiier et al., iyyo 


zo08 225 528 


R. Moxon / Seiler et al, 1996 


zo09 225 NGP165 


R. Moxon / Seiler et al, 1996 


zolO 225 BZ133 


R. Moxon / Seiler et al, 1996 


zoll 225 NGE31 


R. Moxon / Seiler et al, 1996 


zol2 225 NGF26 


R. Moxon / Seiler et al, 1996 


zol3 225 NGE28 


R. Moxon / Seiler et al, 1996 


zol4 225 NGH38 


R. Moxon / Seiler et al, 1996 


zol5 225 SWZ107 


R. Moxon / Seiler et al, 1996 


zol6 225 NGH15 


R. Moxon / Seiler et al, 1996 


zol7 225 NGH36 


R. Moxon / Seiler et al, 1996 


zol8 225 BZ232 


R. Moxon / Seiler et al, 1996 


zol9 225 BZ83 


R. Moxon / Seiler et al, 1996 


zo20 225 44/76 


R. Moxon / Seiler et al, 1996 


zo21 225 MC58 


R. Moxon 


zo96_225 2996 


Our collection 


Group A 




zo22 225 205900 


R. Moxon 


zo23 225 F6124 


R. Moxon 


z2491 Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24 225 90/18311 


R. Moxon 


zo25_225 93/4286 


R. Moxon 
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Others 

zo26_225 A22 (group W) R. Moxon / Maiden et aL, 1 998 
zo27_225 E26 (group X) R. Moxon / Maiden et aL, 1998 
zo28_225 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden et aL, 1998 

Gonococcus 

zo32_225 Ng F62 R. Moxon / Maiden et aL, 1998 

zo33_225 Ng SN4 R. Moxon 

fal 090 FA1 090 R. Moxon 



References: 

Seiler A. et aL, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et aL, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

M D S F FK P AVW A V L W LM FA VR P ALA DE LTN L L S S RE Q I LRQ FAE DEQP V L P VN RA P ARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRVPARRAGNA 
DELIGKAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
I HAPRTGKN IE I T S LSHKYWSGKYAFARRVKKNDPSRFLN * 

2O01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO02_225 <SEQ ID 3118> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKN IE ITS LSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO03_225 <SEQ ID 3119> 

MDSFFKPAVWAVLWLMFAVRLALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFI HAPRTGKN IE IT SLSHKYWSGKYAFARR 
VKKNDPSRFLN* 
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ZO04_225 <SEQ ID 3120> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRV PARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO05_225 <SEQ ID 3121> 

MDSFFKPAVWAVLWLMFAVR PALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO06_225 <SEQ ID 3122> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO07_225 <SEQ ID 3123> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3124> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZO09_225 <SEQ ID 3125> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZOlO_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZOH_225 <SEQ ID 3127> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
I H A PRT GKN I E I T S L S HK Y W S G K Y A FARRVKKN D P S R FLN * 

Z012_225 <SEQ ID 3128> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRV PARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO!3_225 <SEQ ID 3129> 
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MDS FFKPAVWAVLWLM FAVRPALADELTNLLS SREQ I LRQFAE DEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZOl4_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKN IE IT SLSHKYWSGKYAFARR VKKNDPSRFLN* 

Z016_225 <SEQ ID 3132> 

MDS FFKPAVWAVLWLM FAVRPALADELTNLLS SREQ I LRQFAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRV PARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO!7_225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLS SREQ I LRQFAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO!8_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLS SREQ I LRQFAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z019_225 <SEQ ID 3135> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLS SREQ I LRQFAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGT PVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO20_225 <SEQ ID 3136> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLS SREQ I LRQFAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRT LGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z021_225 <SEQ ID 3137> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS SREQI LRQ FAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z022_225 <SEQ ID 3138> 

MDS FFKPAVWAVLWLM FAVRPALADELTNLLS SREQI LRQ FAE DEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 



WO 99/057280 



PCT/US99/09346 



121 



SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z023_225 <SEQ ID 3139> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z024_225 <SEQ ID 3140> 

MDSFFKPAVWAVLWLM FAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z025_225 <SEQ ID 3141> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELI GNAMGLLGIAYRYGGTS I STGFDCSGFMQHI FKRAMG INLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z026_225 <SEQ ID 3142> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DEL I GNAMGLLGIAYRYGGTS I STGFDCSGFMQHI FKRAMG INLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z027_225 <SEQ ID 3143> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DEL IGNAMGLLG I AYRYGGTSVSTGFDCSGFMQH I FKRAMG INLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAFARRAGNA 
DEL I GNAMGLLG I AYRYGGTSVSTGFDCSGFMQH I FKRAMG INLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z029_225 <SEQ ID 3145> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHI FKRAMG INLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQH I FKRAMG INLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z033_225 <SEQ ID 3147> 

MDSFFKPAVWAVLWLMFAVRSALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQH I FKRAMG INLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

Z096 225 <SEQ ID 3148> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAE DEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 12 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 



Table 3 



235 gene variability: List of used Neisseria strains 


Identification Strains 


Reference 


number 






Group B 




gnmzqOl 


NG6/88 


Seiler et al., 1996 


gnmzq02 


BZ198 


Seiler al., 1996 


gnmzq03 


NG3/88 


Seiler et al., 1996 


gnmzq04 


1000 


Seiler et al., 1996 


gnmzq05 


1000 


Seiler et ah, 1996 


gnmzq07 


BZ169 


Seiler et al., 1996 


gnmzq08 


528 


Seiler et al., 1996 


gnmzq09 


NGP165 


Seiler et al., 1996 


gnmzqlO 


BZ133 


Seiler et al., 1996 


gnmzql 1 


NGE31 


Seiler et al., 1996 


gnmzql3 


NGE28 


Seiler et al., 1996 


gnmzql 4 


NGH38 


Seiler et al., 1996 


gnmzql 5 


SWZ107 


Seiler et al., 1996 


gnmzql 6 


NGH15 


Seiler et al., 1996 


gnmzql 7 


NGH36 


Seiler et al., 1996 


gnmzql 8 


BZ232 


Seiler et al, 1996 


gnmzql 9 


BZ83 


Seiler et al., 1996 


gnmzq21 


MC58 


Virjie* al., 1992 


Group A 




gnmzq22 


205900 


Our collection 
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gnmzq23 F6124 
z2491 Z2491 


Our collection 
Maiden etaL, 1998 




Group C 

gnmzq24 90/18311 
gnmzq25 93/4286 


Our collection 
Our collection 




Others 

gnmzq26 A22 (group W) Maiden et aL, 1998 
gnmzq27 E26 (group X) Maiden et al, 1998 
gnmzq28 860800 (group Y) Maiden et aL, 1998 
gnmzq29 E32 (group Z) Maiden et al, 1998 
gnmzq31 N. lactamica Our collection 




Gonococcus 

gnmzq32 Ng F62 
gnmzq33 Ng SN4 


Maiden et aL, 1998 
Our collection 




fal090 FA1090 


Dempsey et aL 1991 




References: 






Seiler A. etat., Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. et aL, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. etaL, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et aL, J. Bacteriol., 1991, 173:5476-5486 





The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3149> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ01 <SEQ ID 3150> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQI LDS VTTVS AKARLV DSRNGKE LWSGS AS I REGSNN SN SGLLGALVS AWNQI ANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ02 <SEQ ID 3151> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK+ 

GNMZQ03 <SEQ ID 3152> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPAS I LWPPLNES PDVNGTWGVLAST 
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AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ04 <SEQ ID 3153> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ05 <SEQ ID 3154> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKENGLTNAADIHAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ07 <SEQ ID 3155> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGY YVFPAAVVEETFKQNGLTNAAD I HAVRPEKLHQ I FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ08 <SEQ ID 3156> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANNLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ09 <SEQ ID 3157> 

MKPLILGLAAALVLSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AE PLS EAGY YVFPAAVVEETFKQNGLTNAAD I HAVQPEKLHQI FGN DAVLY IT I TE YGT S 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGI LKGPRFVEEQPK* 

GNMZQ10 <SEQ ID 3158> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ11 <SEQ ID 3159> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ13 <SEQ ID 3160> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKG PRFVEEQPK* 

GNMZQ14 <SEQ ID 3161> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADI HAVRPEKLHQ I FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ15 <SEQ ID 3162> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADI HAVRPEKLHQ I FGNDAVLY I TVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 



GNMZQ16 <SEQ ID 3163> 
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MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3164> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEET FKQNGLTN AADI HAVRPEKLHQI FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKE LWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ18 <SEQ ID 3165> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSE AG YYVFPAAVVEETFKQNGLTNAAD I HAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ19 <SEQ ID 3166> 

MKPLI LGLAAVLALS ACQVQKAPDFDYT S FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSE AG YYVFPAAVVEETFKQNGLTNAAD I HAVRPEKLHQIFGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ22 <SEQ ID 3167> 

MKPLI LGLAAVLALS ACQVQKAPDFDYT S FKE SKPAS I LVVPPLNE S PDVNGTWGVLAST 
AAPLS EAGY YVFPAAWEET FKQNGLTN AADI HAVRPEKLHQI FGN DAVLY IT VTEYGT S 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ23 <SEQ ID 3168> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADI HAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ24 <SEQ ID 3169> 

MKPLI LGLAAVLALS ACQVQKAPDFDYT S FKESKPAS I LWP PLNE S PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEET FKQNGLTN AADI HAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ25 <SEQ ID 3170> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEET FKQNGLTN AADI HAVRPEKLHQI FGN DAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKG PRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILWPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 
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Group B 



287_2 
287_9 
287_14 
287 21 



BZ198 
NGP165 
NGH38 
MC58 



Seiler e/ a/., 1996 
Seiler e/ a/., 1996 
Seiler et al. 9 1996 
Virji etal, 1992 



z2491 



Group A 

Z2491 



Maiden et aL, 1998 



fal090 



Gonococcus 

FA 1090 



Dempsey et al. 1991 



References: 

Seiler A. et al., Mol. Microbiol., 1996, 19(4):841-856. 

Maiden R. et al 9 Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. et a/., Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. et al, J. Bacteriol., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287_14 <SEQ ID 3179> 

M FKRS V I AMAC I FAL S ACGGGGGG S P DVK S ADT L S K P AA P W S E KE T E AKE DA PQAG S QG 
QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLT PNHTPAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 
FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRS VI AMAC I FAL S ACGGGGGG SPDVKS ADT LSKPAAPVVSEKETE AKE DAPQAGSQG 
QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQN DMPQNAADTDSLTPNHT PAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAGSQNPASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFAR 
FRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_21. <SEQ ID 3181> 

MFKRSVIAMACI FALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQN DMPQNAAGTDSSTPNHTPDP 
N M LAG NM E N Q AT DAG E S S Q PAN Q P DMAN AA DGMQGDDPS AG GQN AG N T AAQG AN Q AGNN Q 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGI IDSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 

287_9 <SEQ ID 3182> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPWTEDVGEEVLPKEKKDEEA 
VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADTDS 
STPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADQA 
ANQAENNQVGGSQN PASSTN PNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 
DFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVI IYKDKSAS 
SSSARFRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYG 
AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLH FHMENGRPSPSGGRFAAKVDFGSKS 
VDGIIDSGDDLKMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT 
DAEKGGFGVFAGKKEQD* 

FA1090 <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPWAENAGEGVLPKEKKDEEA 
AGGAPQADTQDATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAAESAN 
QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDSCNGDN 
LLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYIIFYTDKPPTR 
SARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGS 
YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGSKSVDGI IDSG 
DDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT DAEKGGFG 
VFAGKKDRD* 

Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADN PKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRS 
ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGI IDSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 14 



Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Table 5 



519 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 








Group B 




zv01_ 


519 


NG6/88 


R. Moxon / Seiler et al., 1996 


zv02_ 


.519 


BZ198 


R. Moxon / Seiler et al, 1996 


zv03_ 


519ass 


NG3/88 


R. Moxon / Seiler et al., 1996 


zv04_ 


519 


297-0 


R. Moxon / Seiler et al., 1996 


zv05~ 


519 


1000 


R. Moxon / Seiler et al., 1996 


zv06 


519ass 


BZ147 


R. Moxon / Seiler et al., 1996 


zv07~ 


"519 


BZ169 


R. Moxon / Seiler et al., 1996 


zvlf 


519 


NGE31 


R. Moxon / Seiler et al., 1996 


zvl2 


519 


NGF26 


R. Moxon / Seiler et al., 1996 


zvl8_ 


"519 


BZ232 


R. Moxon / Seiler et al., 1996 


zvl9_ 


.519 


BZ83 


R. Moxon / Seiler et al., 1996 


zv20 


519ass 


44/76 


R. Moxon / Seiler et al., 1996 


zv21~ 


~519ass 


MC58 


R. Moxon 


zv96_ 


"519 


2996 


Our collection 



Group A 

zv22_5 1 9ass 205900 R. Moxon 

z2491 519 Z2491 R. Moxon / Maiden et al, 1998 



Others 

zv26_5 1 9 A22 (group W) R. Moxon / Maiden et al , 1 998 
zv27_519 E26 (group X) R. Moxon / Maiden et al, 1998 
zv28_519 860800 (group Y) R. Moxon / Maiden et al, 1998 
zv29_519ass E32 (group Z) R. Moxon / Maiden et al, 1998 

Gonococcus 

zv32_5 1 9 Ng F62 R. Moxon / Maiden et al, 1 998 

fal090 519 FA1090 R. Moxon 



References: 

Seiler A. et al, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et al., Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
L VAE AN AEA I RQ I AAALQTQGG A DAVN LK I AEQ YV AA FNN LAKE S NT L I M P AN VA D I G S L 
ISAGMKIIDSSKTAK* 
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Z2491_519 <SEQ ID 3186> 

MEFFI ILLAAWVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LV AE AN AE A I RQ I AAALQTQGG ADAVN LK I AEQYVAA FNN LAKE SNT L I M P AN V AD I G S L 
I SAGMKI I DSSKTAK* 

ZV01_519 <SEQ ID 3187> 

MEFFI ILLVAVAVFGFKSFVVI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAE AN AEAIRQI AAALQTQGG ADA VNLKI AEQYVAA FNN LAKE SNTLIM PAN V AD I GSL 
I SAGMKI I DSSKTAK* 

ZV02_519 <SEQ ID 3188> 

MEFFI ILLVAVAVFGFKSFVVI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEAN AEAIRQI AAALQTQGGADA VNLKI AEQYVAA FNN LAKE SNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV03_519 <SEQ ID 3189> 

MEFFI ILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAE AN AE AIRQI AAALQTQGG ADAVN LKI AEQYVAA FNN LAKE SNTLIMPANVADIGSL 
I SAGMK 1 1 D S S KT AK * 

ZV04_519 <SEQ ID 3190> 

MEFFI ILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAE AN AEAIRQI AAALQTQGG ADA VNLKI AEQYVAA FNN LAKE SNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV05_519 <SEQ ID 3191> 

MEFFI I LLVAVAVFGFKSEW I PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNN LAKE SNTLIMPANVADIGSL 
I SAGMK IIDSS KT AK * 

ZV06_519ASS <SEQ ID 3192> 

MEFFI ILLVAVAVFGFKSFVVI PQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVFSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERK 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAE AN AEAIRQI AAALQTQGGADA VNLKI AEQYVAA FNN LAKE SNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV07_519 <SEQ ID 3193> 

MEFFI I LLVAVAV FGFKS FW I PQQE VH WERLGR FHRALT AGLN ILIPFI DRVAYRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKE SNTLIMPANVADIGSL 
ISAGMKII DSSKTAK* 
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ZV11_519 <SEQ ID 3194> 

ME FFI I LLAAVAVFGFKSFVV I PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV12_519 <SEQ ID 3195> 

MEFFI I LLVAVAVFGFKS FVVI PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV18_519 <SEQ ID 3196> 

ME FFI I LLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAE RE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV19_519 <SEQ ID 3197> 

MEFFIILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV20_519ASS <SEQ ID 3198> 

MEFFIILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INST WAALDEAAGAWGVKVLRYE I KDLV PPQE I LRSMQAQIT AERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSM 
I SAGMKI I DSSKTAK* 

ZV21_519ASS <SEQ ID 3199> 

MEFFIILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNI LI PFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV22_519ASS <SEQ ID 3200> 

ME FFI I LLAAWVFGFKSFWI PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INSTVVSALDEAAGAWGVKVLRYE I KDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV26_519 <SEQ ID 3201> 

MEFFIILLAAVWFGFKS FWI PQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQE I LRSMQAQITAE RE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV27 519 <SEQ ID 3202> 
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MEFFIILLVAVAVFGFKSFVVIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGII YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQE ILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV28_519 <SEQ ID 3203> 

MEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGII YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN STWAALDEAAGAWGVKVLRYE I KDLVPPQE ILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
L V AE AN AE A I RQ I AAALQTQGG A D AVN LK I AE Q YV AA FNN LAKE S NT L I M P AN V A D I G S L 
ISAGMKIIDSSKTAK* 

ZV29_519ASS <SEQ ID 3204> 

MEFFIILLAAVAVFG FKSFWIPQQEVHWERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGII YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINS I VVSALDEAAGAWGVKVLRYEIKDLVPPQE ILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSNKTAK* 

ZV32_519 <SEQ ID 3205> 

MEFFIILLAAVAVFG FKSFWIPQQEVHWERLGRFHRALTAGLNI LI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYE I KDLVPPQE ILRAMQAQITAE RE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV96_519 <SEQ ID 3206> 

ME FFI I LLAAVAVFG FKS FW I PQQEVHVVERLGR FHRALTAGLN I L I PFI DRVA YRHS L 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN STWAALDEAAGAWGVKVLRYE I KDLVPPQE ILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASN7VEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMK I IDSS KT AK * 

Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 15 



Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 919 among different strains. 
Table 6 



919 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 



Group B 



zmOl 


NG6/88 


R. Moxon / Seil 


er et aL, 


1996 


zm02 


BZ198 


R. Moxon / Seil 


er et aL, 


1996 


zm03 


NG3/88 


R. Moxon / Sei 


er et aL, 


1996 


zm04 


297-0 


R. Moxon / Sei 


er et al., 


1996 


zm05 


1000 


R. Moxon / Seil 


[er et aL, 


1996 


zm06 


BZ147 


R. Moxon / Sei 


er et al., 


1996 


zm07 


BZ169 


R. Moxon / Sei 


er et al., 


1996 


zm08n 528 


R. Moxon / Sei 


er et al., 


1996 


zm09 


NGP165 


R. Moxon / Sei 


er et al., 


1996 


zmlO 


BZ133 


R. Moxon / Sei 


er et al., 


1996 


zmllasbc NGE31 


R. Moxon / Sei 


er et al., 


1996 


zml2 


NGF26 


R. Moxon / Seil 


er et ah, 


1996 


zml3 


NGE28 


K. Moxon / beiJ 


er et al., 


1 (\f\/Z 

lyyt) 


zml4 


NGH38 


R. Moxon / Seil 


er et aL, 


1996 


?:ml5 


SWZ107 


R. Moxon / Sei] 


er et aL, 


1996 


im\6 


NGH15 


R. Moxon / Seil 


ier et aL, 


1996 


zm\7 


NGH36 


R. Moxon / Sei 


ler et aL, 


1996 


zml8 


BZ232 


R. Moxon / Sei 


[er et aL, 


1996 


zml9 


BZ83 


R. Moxon / Sei 


ler et aL, 


1996 


zm20 


44/76 • 


R. Moxon / Sei 


[er a/., 


1996 


zm21 


MC58 


R. Moxon 






zm96 


2996 


Our collection 








Group A 








zm22 


205900 


R. Moxon 






zm23asbc F6124 


R. Moxon 






z2491 


Z2491 


R. Moxon / Maiden et aL, 




Group C 








zm24 


90/18311 


R. Moxon 






zm25 


93/4286 


R. Moxon 







Others 

zm26 A22 (group W) R. Moxon / Maiden et aL, 1998 
zm27bc E26 (group X) R. Moxon / Maiden et aL, 1998 

zm28 860800 (group Y) R. Moxon / Maiden et aL, 1998 
zm29asbc E32 (group Z) R. Moxon / Maiden et aL, 1998 
zm31asbc N. lactamica R. Moxon 

Gonococcus 

zm32asbc Ng F62 R. Moxon / Maiden et aL, 1998 

zm33asbc Ng SN4 R. Moxon 

fal090 FA1090 R. Moxon 
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References: 

Seiler A. et aL, Mol. Microbiol, 1996, 19(4):841-856. 
Maiden et al„ Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3207> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDGRRTERARFPI YGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

Z2491 <SEQ ID 3208> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM01 <SEQ ID 3209> 

MKKYLFRAALYG I AAAI LAACQSKS I QT FPQPDT SVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YR P * 

ZM02 <SEQ ID 3210> 

MKKYLFRAALYG IAAAI LAACQSKS IQT FPQPDT SVINGPDRPVGI PDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTI DNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDY FWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYE PVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRFVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYG I PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM05 <SEQ ID 3213> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLSCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMRQNPQRLAEVLGQN PS Y I FFRE LAGS SNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYI FFRE LAGS SNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGY YE PVLKGDDRRTAQARFPIYG I PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQN PS Y I FFRE LAGS SNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM08N <SEQ ID 3216> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YT W PHL S L PHWAAQDFAKS LQS FRLGC AN LKNRQGWQDVC AQAFQT PVH S FQAKQ FFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMRQNPQRLAEVLGQN PS Y I FFRELAGS SNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM09 <SEQ ID 3217> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
Y FT PWQVAGNG S LAGT VTG Y YE PVLKG DDRRT AQAR FP I YG I P D D FI S V PL PAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQG I KSYMRQNPQRLAEVLGQN PS Y I FFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM10 <SEQ ID 3218> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YT W PHLS L PHWAAQD FAKS LQS FRLGCAN LKNRQGWQDVCAQAFQT PVHS FQAKQ FFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLG QTSMQGIKS YMRQN PQRLAEVLGQN PS Y I FFRELTGSGNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

* 

ZM11ASBC <SEQ ID 3219> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG I KS YMRQN PQRLAEVLGQN PS Y I FFRELTGS SN DG PVG ALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
Y FT PWQVAGNG S LAGT VTG Y YE P VLKG DDRRT AQAR FP I YG I P D D FI S VP L P AGLR S GKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQT SMQG I KS YMRQN PQRLAEVLGQN PS Y I FFRELAGS SN DG PVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM13 <SEQ ID 3221> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
K LGQT SMQG I KAYMRQN PQRLAE V LGQN P S Y I FFRE LAG S SN DG PVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM14 <SEQ ID 3222> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDT SVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG I KS YMRQN PQRLAE V LGQN PS Y I FFRE LTGSRN DG PVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDLAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG IKS YMRQN PQRLAEVLGQN PS YIFFRELTGSGNDG PVGALGT PLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

* ZM16 <SEQ ID 3224> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPGRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQT SMQG IKS YMRQN PQRLAEVLGQN PS Y I FFRELTGSSNDG PVGALGT PLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM17 <SEQ ID 3225> 
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MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PNGMKPE YRP * 

ZM18 <SEQ ID 3226> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
K LGQT SMQG IKS YMRQN PQRLAE V LGQN P S Y I F FRE LAG S SN DG P VG ALGT P LMGE Y AG A 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM19 <SEQ ID 3227> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAEVLGQN PS YIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLL PN GMK PE YR P * 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQN PQRLAEVLGQN PS YIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMQQNPQRLAEVLGQN PS YIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYI FFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTSKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKE PG YVWQLLPNGMKPE YRP * 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYI FFRELTGSGN DG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTG YVWQLLPNGMKPE YRP* 

ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYI FFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM26 <SEQ ID 3234> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMQQNPQRLAEVLGQN PS Y I FFRELTGSSNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM27BC <SEQ ID 3235> 

MKKYLFRAALYGI S AAI LAACQSKS IQT FPQPDT S VING PDRPAG I PDPAGTTVAGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYI FFRELTGSSNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKE PG YVWQLLPNGMKPE YRP* 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAI LAACQSKS IQT FPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMRQN PQRLAEVLGQNPSYI FFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM29ASBC <SEQ ID 3237> 

MKKYLFRAALCGIAAAI LAACQSKS IQT FPQPDTS VING PDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFP I YGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQN PQRLAEVLGQNPSYI FFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATTHPITRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM31ASBC <SEQ ID 3238> 
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MKKHLFRAALYGIAAAILAACQSKSIQTFPQPDTSIIKGPDRPAGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYVFFRE LAGSGNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM32ASBC <SEQ ID 3239> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM33ASBC <SEQ ID 3240> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPIHSFQAKRFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM96 <SEQ ID 3241> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG I KAYMRQNPQRLAEVLGQN PS Y I FFRELAGSSNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PCR to amplify complete or partial ORFs 
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ORF 


primer Sequence 


Restriction 
sites 


001 


Forward CGCGGATCCCATATG-TGGATGGTGCTGGTCAT 


BamHl- 


Ndel 




Reverse CCCGCTCGAG-TGCCGTCTTGTCCCAC 


Xhol 


003 


Forward CGCGGATCCCATATG-GTCGTATTCGTGGC 


BamHI- 




Ndel 




Reverse CCCGCTCGAG-AAAATCATGAACACGCGC 


Xhol 


005 


Forward CGCGGATCCCATATG-GACAATATTGACATGT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CATCACATCCGCCCG 


Xhol 


006 


Forward CGCGGATCCCATATG-CTGCTGGTGCTGG 


BamHI- 




Ndel 




Reverse CCCGCTCGAG-AGTTCCGGCTTTGATGT 


Xhol 


007 


Forward CGCGGATCCCATATG-GCCGACAACAGCATCAT 


BamHI- 




Ndel 




Reverse CCCGCTCGAG-AAGGCGTTCATGATATAAG 


Xhol 


008 


Forward CGCGGATCCCATATG-AACAACAGACATTTTG 


BamHI- 




Ndel 




Reverse CCCGCTCGAG-CCTGTCCGGTAAAAGAC 


Xhol 


009 


Forward CGCGGATCCCATATG-CCCCGCGCTGCT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TGGCTTTTGCCACGTTTT 


Xhol 


Oil 


Forward CGCGGATCCCATATG-AAGACACACCGCAAG 


BamHI- 




Ndel 




Reverse CCCGCTCGAG-GGCGGTCAGTACGGT 


Xhol 


012 


Forward CGCGGATCCCATATG-CTCGCCCGTTGCC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGCGGGGAAGAGGCAC 


Xhol 


013 


Forward CGCGGATCCCATATG-CCTTTGACCATGCT 


BamHI- 






IN del 




Reverse CCCGCTCGAG-CTGATTCGGCAAAAAAATCT 


AnOl 


018 


Forward CGCGGATCCCATATG-CAGCAGAGGCAGTT 


BamHI- 






iNaei 




Reverse CCCGCTCGAG-GACGAGGCGAACGCC 


Xhol 


019 


Forward AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGC 


EcoRI 




Reverse AAACTGCAG-TCAGCGGGCGGGGACAATGCCCAT 


Pst I 


023 


Forward AAAGAATTC-AAAGAATATTCGGCATGGCAGGC 


EcoRI 




Reverse AAACTGCAG-TTACCGCCAAATCACTTTAACTGA 


Pst I 


025 


Forward AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 


EcoRI 




Reverse AAACTGCAG-TCAGAACGCGATATAGCTGTTCGG 


Pst I 


031 


Forward CGCGGATCCCATATG-GTCTCCCTTCGCTT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ATGTAAGACGGGGACAAC 


Xhol 


032 


Forward CGCGGATCCCATATG-CGGCGAAACGTGC 


BamHI- 
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Ndel 




Reverse 


CCCGCTCG AG-CTGGT1 "1 TTTG AT ATTTGTG 


Xhol 




i \J\ WdJu 


CdCCiGATCCC AT ATn.nCGCrCGCiC AG AC A 










11UC1 




Reverse 


CCCGCTCGAG-ATTTGCCGCATCCCGAT 


Xhol 




rui vvdju 


rnrrTpt a Trrr ata Tr.nr m a a a a c a nn a rnn 


Ram HI 












Reverse 


CCCGCTCGAG-TTTGACGATTTGGTTCAATT 


/VI IvJl 


UjO 


UI WdlU 


rriprTpi a Trrr ata Tn_r th a a nr rrrmr n 


Darnrii- 








1NOCJ 




R pver^p 

V vl )v 


CCCGCTCGAG-CCGGACTGCGTATCGG 


Xhol 

/VJ1VJ1 


UJo 


rorwaro 


rnrnn a Trrr ata tg a rrn a TTTrrnrr a 


Danirii- 








iNaci 




Reverse 


CCCGCTCGAG-TTCTACGCCGTACTGCC 


Xhol 


039 


Forward 


CGCGGATCCCATATG-CCGTCCGAACCGC 










Ndel 




Reverse 


CCCGCTCGAG-TAGGATGACGAGGTAGG 


Xhol 


041 


Forward 


CGCGGATCCCATATG-TTCGTGCGCGAACCGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCCCAAAAACTCTTTCAAA 


Xhol 


042 


Forward 


CGCGGATCCCATATG-ACGATGATTTGCTTGC 


BarnHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGCAGCCTGCATTTGAC 


Xhol 


043 


Forward 


AAAAAAGGTACC-ATGGTTGTTTCAAATCAAAATATC 


Kpnl 




Reverse 


AAACTGCAG-TTATTGCGCTTCACCTTCCGCCGC 


Pst I 


043 a 


Forward 


AAAAAAGGTACC-GCAAAAGTGCATGGCGGCTTGGACGGTGC 


Kpn 1 




Reverse 


AAAAAACTGCAG- 


Pst I 






TTAATCCTGCAACACGAATTCGCCCGTCCG 




044 


Forward 


CGCGGATCCCATATG-CCGTCCGACTAGAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATGCGCTACGGTAGCCA 


Xhol 


046 


Forward 


AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG 


EcoRI 




Reverse 


AAACTGCAG-TCACTCGGCGACCCACACCGTGAA 


Pst I 


047 


Forward 


CGCGGATCCCATATG-GTCATCATACAGGCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCCGAAAAAGCCCAITriG 


Xhol 


048 


Forward 


AAAGAATTC-ATGCTCAACAAAGGCGAAGAATTGCC 


EcoRI 




Reverse 


AAACTGCAG-TCAAGATTCGACGGGGATGATGCC 


Pst I 


049 


Forward 


AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC 


Eco RJ 




Reverse 


AAACTGC AG - AAGGCGTATCTG AAAAAATGGC AG 


Pst I 


050 


Forward 


CGCGGATCCCATATG-GGCGCGGGCTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AATCGGGCCATCTTCGA 


Xhol 


052 


Forward 


AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC 


EcoRI 




Reverse 


AAAAAAGTCGAC-TCAGGCGGCGTTTTTCACCTTCCT 


Sail 


052a 


Forward 


AAAAAAGAATTC-GTGGCGGAGGAAACGGAAATATCCGC 


EcoRI 
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Reverse AAAAAA CTGCAG -TTAGCTGTTTTTGGAAACGCCGTCCAACCC Pst 1 



073 Forward CGC GGATCCCATATG -TGTATGCCATATAAGAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CACCGGATTGTCCGAC Xhol 

075 Forward CGC GGATCCCATATG -CCGTCTTACTTCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCACCAATGCCGATTATTT Xhol 

077a Forward AAAAAA GAATTC -GGCGGCATTTTCATCGACACCTTCCT Eco RI 

Reverse AAAAAA CTGCAG -TCAGACGAACATCTGCACAAACGCAAT Pst I 

080 Forward AAAGAATTC-GCGTCCGGGCTGGTTTGGTTTTACAATTC Eco RI 
Reverse AAA CTGCAG -CTATTCTTCGGATTCTTTTTCGGG Pst I 

081 Forward AAA GAATTC -ATGAAACCACTGGACCTAAATTTCATCTG Eco RI 
Reverse AAA CTGCAG -TCACTTATCCTCCAATGCCTC Pst I 

082 Forward AAA GAATTC -ATGTGGTTGTTGAAGTTGCCTGC Eco RI 
Reverse AAA CTGCAG -TTACGCGGATTCGGCAGTTGG Pst I 

084 Forward AAA GAATTC -TATCACCCAGAATATGAATACGGCTACCG Eco RI 
Reverse AAA CTGCAG -TTATACTTGGGCGCAACATGA Pst I 

085 Forward CGC GGATCCCATATG -GGTAAAGGGCAGGACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CAAAGCCTTAAACGCTTCG Xhol 

086 Forward AAAAAA GGTACC -TATTTGGCATCAAAAGAAGGCGG Kpn I 
Reverse AAA CTGCAG -TTACTCCACCCGATAACCGCG Pst I 

087 Forward AAA GAATTC -ATGGGCGGTAAAACCTTTATGC Eco RI 
• Reverse AAA CTGCAG -TTACGCCGCACACGCAATCGC Pst I 

087a Forward AAAAAA GAATTC -AAGCTATTAGGCGTGCCGATTGTGATTCA Eco RI 

Reverse AAAAAA CTGCAG -TTACGCCTGCAAGATGCCCAGCTTGCC Pst I 

088 Forward AAAAAA GAATTC -ATGTTTTTATGGCTCGCACATTTCAG Eco RI 
Reverse AAAAAA CTGCAG -TCAGCGGATTTTGAGGGTACTCAAACC Pst I 

089 Forward CGC GGATCCCATATG -CCGCCCAAAATCAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGCGCATACCAAAGCCA Xhol 

090 Forward CGC GGATCCCATATG -CGCATAGTCGAGCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGCAAAACGGCGGTACG Xhol 

091 Forward AAA GAATTC -ATGGAAATACCCGTACCGCCGAGTCC Eco RI 
Reverse AAA CTGCAG -TCAGCGCAGGGGGTAGCCCAAGCC Pst I 

092 Forward AAA G AATTC - ATGT'l I' 1 "1 "I ATTTC AATCCG Eco RI 
Reverse AAA CTGCAG -TCAAATCTGTTTCGACAATGC Pst I 

093 Forward AAA GAATTC -ATGCAGAATTTTGGCAAAGTGGC Eco RI 
Reverse AAA CTGCAG -CTATGGCTCGTCATACCGGGC Pst I 

094 Forward AAA GAATTC -ATGCCGTCACGGAAGCGCATCAACTC Eco RI 
Reverse AAA CTGCAG -TTATCCCGGCCATACCGCCGAACA Pst I 

095 Forward AAA GAATTC -ATGTCCTTTCATTTGAACATGGACGG Eco RI 
Reverse AAA CTGCAG -TCAACGCCGCAGGCACTAACGCCC Pst I 

096 Forward AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG Eco RI 
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Reverse AAA CTGCAG -TCAAAGGAAAAGGCCGTCTGAAAAGCG Pst 1 

097 Forward AAA G AATTC - ATGG AC ACTTC AAAAC AAAC ACTGTTG EcoRI 
Reverse AAACTGCAG-TCAGCCCAAATACCAGAATTTCAG Pst I 

098 Forward AAA GAATTC -GATGAACGCAGCCCAGCATGGATACG Eco RI 
Reverse AAA CTGCAG -TTACGACATTCTGATTTGGCA Pst I 

102 Forward AAAAAA GAATTC -GGCCTGATGATTTTGGAAGTCAACAC Eco RJ 

Reverse AAAAAA CTGCAG -TTATCCTTTAAATACGGGGACGAGTTC Pst I 
105 Forward CGC GGATCCCATATG -TCCGCAAACGAATACG BarnHI- 

Ndel 

Reverse CCCGCTCGAG-GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAAA GAATTC - Eco RI 

CTGATGATTTTGGAAGTCAACACCCATTATCC 
Reverse AAAAAA CTGCAG -TTATCCTTTAAATACGGGGACGAGTTC Pst I 
107b Forward AAAAAA GAATTC - Eco RI 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAAA CTGCAG - Pst 1 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AAA GAATTC -ATGTTGCCGGGCTTCAACCG Eco RI 
Reverse AAA CTGCAG -TTAGCGGTACAGGTGTTTGAAGCA Pst I 

108a Forward AAAAAA GAATTC -GGTAACACATTCGGCAGCTTAGACGGTGG Eco RI 
Reverse AAA CTGCAG -TTAGCGGTACAGGTGTTTGAAGCA Pst I 

109 Forward AAA GAATTC -ATGTATTATCGCCGGGTTATGGG EcoRI 
Reverse AAA CTGCAG -CTAGCCCAAAGATTTGAAGTGTTC Pst I 

1 1 1 Forward CGC GGATCCCATATG -TGTTCGGAACAAACCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGGAGCAGTTTTTCAAA Xhol 
114 Forward CGC GGATCCCATATG -GCTTCCATCACTTCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CATCCGCGAAATCGTC Xhol 

1 1 7 Forward AAAAAA GGTACC -ATGGTCGAAGAACTGGAACTGCTG Kpn I 
Reverse AAA CTGCAG -TTAAAGCCGGGTAACGCTCAATAC Pst I 

1 1 8 Forward AAA GTCGAC ATGTGTGAGTTCAAGGATATTATAAG Sal I 
Reverse AAA GCATGC -CTATTTTTTGTTGTAATAATCAAATC Sph I 

121 Forward CGC GGATCCCATATG -GAAACACAGCTTTACAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATAATAATATCCCGCGCCC Xhol 

122 Forward CGC GGATCCCATATG -GTCATGATTAAAATCCGCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -AATCTTGGTAGATTGGATTT Xhol 

125 Forward AAA GAATTC -ATGTCGGGCAATGCCTCCTCTCC EcoRI 
Reverse AAA CTGCAG -TCACGCCGTTTCAAGACG Pst I 

125a Forward AAAAAA GAATTC - ACGGCAGGCAGCACCGCCGCACAGGTTTC EcoRI 
Reverse AAAAAA CTGCAG - Pst I 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGC GGATCCCATATG -CCGTCTGAAACCC BamHI- 
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Ndel 



Reverse CCCG CTCGAG -ATATTCCGCCGAATGCC Xhol 

127 Forward AAA GAATTC -ATGGAAATATGGAATATGTTGGACACTTG Eco RI 
Reverse AAA CTGCAG -TTAAAGTGTTTCGGAGCCGGC Pst I 

127a Forward AAAAAA GAATTC -AAGGAACTGATTATGTGTCTGTCGGG Eco RI 

Reverse AAA CTGCAG -TTAAAGTGTTTCGGAGCCGGC Pst I 

128 Forward CGC GGATCCCATATG -ACTGACAACGCACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -GACCGCGTTGTCGAAA Xhol 

130 Forward CGC GGATCCCATATG -AAACAACTCCGCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAATTTTGCACCGGATTG Xhol 

132 Forward AAA GAATTC -ATGGAACCCTTCAAAACCTTAATTTG Eco RI 

Reverse AAAAAA CTGCAG -TCACCATGTCGGCATTTGAAAAAC Pst I 

134 Forward CGC GGATCCCATATG -TCCCAAGAAATCCTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CAGTTTGACCGAATGTTC Xhol 

135 Forward CGC GGATCCCATATG -AAATACAAAAGAATCGTATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAATTCGGTCAGAAGCAGG Xhol 

137 Forward AAAAAA GGTACC -ATGATTACCCATCCCCAATTCGATCC Kpn I 
Reverse AAAAAA CTGCAG -TCAGTGCTGTTTTTTCATGCCGAA Pst I 

137a Forward AAAAAA GAATTC -GGCCGCAAACACGGCATCGGCTTCCT Eco RI 

Reverse AAAAAA CTGCAG -TTAAGCGGGATGACGCGGCAGCATACC Pst I 

138 Forward AAAAAA GAATTC -AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAAA TCTAGA- TCAGTTTAGGGATAGCAGGCGTAC Xba I 

141 Forward AAA GAATTC -ATGAGCTTCAAAACCGATGCCGAAATCGC Eco RI 
Reverse AAA CTGCAG -TCAGAACAAGCCGTGAATCACGCC Pst I 

142 Forward CGC GGATCCCATATG -CGTGCCGATTTCATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAACTGCTGCACATGGG Xhol 

143 Forward AAAAAAGAATTC- Eco RI 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

Reverse AAAAAA CTGCAG -TCAAACCCCGCCGTGTGTTTCTTTAAT Pst I 

144 Forward AAAAAA GAATTC -GGTCTGATCGACGGGCGTGCCGTAAC Eco RI 
Reverse AAAAAA TCTAGA -TCGGCATCGGCCGGCATATGTCCG Xba I 

146 Forward AAAAAA GAATTC - Eco RI 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

Reverse AAAAAA CTGCAG -TTAGGCATCGGCAAATAGGAAACTGGG Pst I 

147 Forward AAAAAA GAATTC -ACTGAGCAATCGGTGGATTTGGAAAC Eco RI 
Reverse AAAAAA TCTAGA -TTAGGTAAAGCTGCGGCCCATTTGCGG Xba I 

148 Forward AAAAAA GAATTC - Eco RI 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

Reverse AAAAAA TCTAGA -TCAGCCCTTCATACAGCCTTCGTTTTG Xba I 

149 Forward CGCGGATCCCATATG-CTGCTTGACAACAAAGT BamHI- 
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Ndel 




Reverse 


CCCGCTCGAG-AAACTTCACGTTCACGCC 


Xhol 


150 


Forward 


CGCGGATCCCATATG-CAGAACACAAATCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATAAACATCACGCTGATAGC 


Xhol 


151 


Forward 


AAAAAAGAATTC- 


Eco RJ 






ATGAAACAAATCCGCAACATCGCCATCATCGC 






Reverse 


AAAAAACTGCAG-TCAATCCAGCTTTTTAAAGTGGCGGCG 


Pst I 


152 


Forward 


AAAAAA GAATTC- 


Eco RJ 






ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 






Reverse 


AAAAAACTGCAG-TCAGGACAGGAGCAGGATGGCGGC 


Pst I 


153 


Forward 


AAAAAAGAATTC-ATGGCGTTTGCTTACGGTATGAC 


EcoRJ 




Reverse 


AAAAAACTGCAG-TCAGTCATGTTTTTCCGTTTCATT 


Pst I 


153a 


Forward 


AAAAAA GAATTC-CGGACTTCGGTATCGGTTCCCCAGCATTG 


EcoRI 




Reverse 


AAAAAACTGCAG- 


Pst I 






TTACGCCGACGAAA'i AC 1CAG ACT! J 1 CGG 




154 


Forward 


CGCGGATCCCATATG-ACTGACAACAGCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGGCTTCCTTTCGGG 


Xhol 


155 


Forward 


AAAAAA GAATTC-ATGAAAATCGGTATCCCACGCGAGTC 


EcoRI 




Reverse 


AAAAAACTGCAG-TTACCCTTTCTTAAj^CATATTCAGCAT 


Pst I 


156 


Forward 


AAAAAAGAATTC-GCACAGCAAAACGGTTTTGAAGC 


Eco RI 




Reverse 


AAAAAACTGCAG-TCAAGCAGCCGCGACAAACAGCCC 


Pst I 


157 


Forward 


CGCGGATCCCATATG-AGGAACGAGGAAAAAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAAACACAATATCCCCGC 


Xhol 


158 


Forward 


AAAAAA GAATTC-GCGGAGCAGTTGGCGATGGC AAA TTCTGC 


EcoRI 




Reverse 


AAAAAATCTAGA-TTATCCACAGAGATTGTTTCCCAGTTC 


Xbal 


160 


Forward 


CGCGGATCCCATATG-GACATTCTGGACAAAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTTTGCCCGCCTTCTTT 


Xhol 


163 


Forward 


AAAAAAGGTACC-ACCGTGCCGGATCAGGTGCAGATGTG 


Kpnl 




Reverse 


AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG 


Xbal 


163a 


Forward 


AAAAAAGAATTC-CGGCTGGTGCAGATAATGAGCCAGAC 


Eco RI 




Reverse 


AAAAAATCTAGA-TTACTCTGCCAATTCCACCTGCTCGTG 


Xbal 


164 


Forward 


CGCGGATCCCATATG-AACCGGACTTATGCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGTTTCCGTCAAACTGC 


Xhol 


165 


Forward 


CGCGGATCCGCTAGC-GCTGAAGCGACAGACG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-AATATCCAATACTTTCGCG 


Xhol 


206 


Forward 


CGCGGATCCCATATG-AAACACCGCCAACCGA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC 


Xhol 


209 


Forward 


CGCGGATCCCATATG-CTGCGGCATTTAGGA 


BamHI- 








Ndel 



SUBSTITUTE SHEET (RULE 26) 



WO 99/057280 



PCT/US99/09346 



146 





Reverse 


CCCGCTCGAG-TACCCCTGAAGGCAAC 


Xhol 


211 


Forward 


AAAAAAGAATTC-ATGTTGCGGGTTGCTGCTGC 


Eco RJ 




Reverse 


AAAAAACTGCAG-CTATCCTGCGGATTGGCATTGAAA 


Pst I 


212 


Forward 


CGCGGATCCCATATG-GACAATCTCGTATGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGGGGTTAGATCCTTCC 


Xhol 


215 


Forward 


CGCGGATCCCATATG-GCATGGTTGGGTCGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CATATCTTTTGTATCATAAATC 


Xhol 


216 


Forward 


CGCGGATCCCATATG-GCAATGGCAGAAAACG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TACAATCCGTGCCGCC 


Xhol 


217 


Forward 


CGCGGATCCCATATG-GCGGATGACGGTGTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACCCCGAATATCGAATCC 


Xhol 


218 


Forward 


CGCGGATCCCATATG-GTCGCGGTCGATC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TAACTCATAGAATCCTGC 


Xhol 


219 


Forward 


CGCGGATCCGCTAGC-ACGGCAAGGTTAAG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTTAAACCATCTCCTCAAAAC 


Xhol 


223 


Forward 


CGCGGATCCCATATG-GAATTCAGGCACCAAGTA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGCTTCCCGCGTGTC 


Xhol 


225 


Forward 


CGCGGATCCCATATG-GACGAGTTGACCAACC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GTTCAGAAAGCGGGAC 


Xhol 


226 


Forward 


AAAGAATTC-CTTGCGATTATCGTGCGCACGCG 


EcoRl 




Reverse 


AAACTGCAG-TCAAAATCCCAAAACGGGGAT 


Pst I 


228 


Forward 


CGCGGATCCCATATG-TCGCAAGAAGCCAAACAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGGCGGCATCTTTCAT 


Xhol 


229 


Forward 


CGCGGATCCCATATG-CAAGAGGTTTTGCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACACAATATAGCGGATGAAC 


Xhol 


230 


Forward 


CGCGGATCCCATATG-CATCCGGGTGCCGAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAGTTTGGCGGCTTCGG 


Xhol 


232 


Forward 


AAAAAAGAATTC-ATGTACGCTAAAAAAGGCGGTTTGGG 


Eco RI 




Reverse 


AAAAAACTGCAG-TCAAGGTTTTTTCCTGATTGCCGCCGC 


Pst I 


232a 


Forward 


AAAAAAGAATTC-GCCAAGGCTGCCGATACACAAATTGA 


EcoRI 




Reverse 


AAAAAACTGCAG-TTAAACATTGTCGTTGCCGCCCAGATG 


Pst I 


233 


Forward 


CGCGGATCCCATATG-GCGGACAAACCCAAG 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-GACGGCATTGAGCAG 


Xhol 


234 


Forward 


CGCGGATCCCATATG-GCCGTTTCACTGACCG 


BamHI- 
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Ndel 




Reverse 


GCCCAAGCTT-ACGGTTGGATTGCCATG 


Hind III 


235 


Forward 


CGCGGATCCCATATG-GCCTGCCAAGTTCAAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGGGCTGCTCTTC 


Xhol 


236 


Forward 


CGCGGATCCCATATG-GCGCGTTTCGCCTT 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-ATGGGTCGCGCGCCGT 


Xhol 


238 


Forward 


CGCGGATCCGCTAGC-AACGGTTTGGATGCCCG 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-TTTGTCTAAGTTCCTGATATG 


Xhol 


239 


Forward 


CCGGAATTCTACATATG-CTCCACCATAAAGGTATTG 


EcoRI- 








Ndel 




Reverse 


CCCGCTCGAG-TGGTGAAGAGCGGTTTAG 


Xhol 


240 


Forward 


CGCGGATCCCATATG-GACGTTGGACGATTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAACGCCATTACCCGATG 


Xhol 


241 


Forward 


CCGGAATTCTACATATG-CCAACACGTCCAACT 


EcoRI- 








Ndel 




Reverse 


CCCGCTCGAG-GAATGCGCCTGTAATTAATC 


Xhol 


242 


Forward 


CGCGGATCCCATATG-ATCGGCAAACTTGTTG 


BamHI- 








Ndel 




Reverse 


GCCCAAGCTT-ACCGATACGGTCGCAG 


Hindin 


243 


Forward 


CGCGGATCCCATATG-ACGATTTTTTCGATGCTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGACTTGGTTACCGCG 


Xhol 


244 


Forward 


CGCGGATCCCATATG-CCGTCTGAAGCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTrrri CGGTAGGGGATTT 


Xhol 


246 


Forward 


CGCGGATCCCATATG-GACATCGGCAGTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCCGCGCTGCTGGAG 


Xhol 


247 


Forward 


CGCGGATCCCATATG-GTCGGATCGAGTTAC 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-AAGTGTTCTGTTTGCGCA 


Xhol 


248 


Forward 


CGCGGATCCCATATG-CGCAAACAGAACACT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTCATCATTATTGCTAACA 


Xhol 


249 


Forward 


CGCGGATCCCATATG-AAGAATAATGATTGCTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCCGACCTCCGAC 


Xhol 


251 


Forward 


CGCGGATCCCATATG-CGTGCTGCGGTAGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TACGAAAGCCGGTCGTG 


Xhol 


253 


Forward 


AAAAAAGAATTC-ATGATTGACAGGAACCGTATGCTGCG 


EcoRI 




Reverse 


AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 


PstI 
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253a Forward AAAAAA GAATTG -AAAATCCTTTTGAAAACAAGCGAAAACGG 
Reverse AAAAAA CTGCAG -TTATTGGTCTTTCAAACGCCCTTCCTG 

254 Forward AAAAAA GAATTC -ATGTATACAGGCGAACGCTTCAATAC 
Reverse AAAAAA TCTAGA -TCAGATTACGTAACCGTACACGCTGAC 

255 Forward CGC GGATCCCATATG -GCCGCGTTGCGTTAC 

Reverse CCCG CTCGAG -ATCCGCAATACCGACCAG 

256 Forward CGC GGATCCGCTAGC -TTTTAACACCGCCGGAC 

Reverse CCCG CTCGAG -ACGCCTGTTTGTGCGG 

257 Forward CGC GGATCCCATATG -GCGGTTTCTTTCCTG 

Reverse CCCG CTCGAG -GCGCGTGAATATCGCG 

258 Forward AAAAAA GAATTC -GATTATTTCTGGTGGATTGTTGCGTTCAG 
Reverse AAAAAA CTGCAG -CTACGCATAAGT1 I I 1 ACCGTTTTTGG 

258a Forward AAAAAA GAATTC -GCGAAGGCGGTGGCGCAAGGCGA 

Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG 

259 Forward CGC GGATCCCATATG -GAAGAGCTGCCTCCG 

Reverse CCCG CTCGAG -GGCTTTTCCGGCGTTT 

260 Forward CGC GGATCCCATATG -GGTGCGGGTATGGT 

Reverse CCCG CTCGAG -AACAGGGCGACACCCT 

261 Forward AAAAAA GAATTC -CAAGATACAGCTCGGGCATTCGC 
Reverse AAAAAA CTGCAG -TCAAACCAACAAGCCTTGGTCACT 

263 Forward CGC GGATCCCATATG -GCACGTTTAACCGTA 

Reverse CCCG CTCGAG -GGCGTAAGCCTGCAATT 

264 Forward AAAAAA GGTACC -GCCGACGCAGTGGTCAAGGCAGAA 
Reverse AAA CTGCAG -TCAGCCGGCGGTCAATACCGCCCG 

265 Forward AAAAAA GAATTC -GCGGAGGTCAAGAGAAGGTGTTTG 
Reverse AAAAAA CTGCAG -TTACGAATACGTCGTCAAAATGGG 

266 Forward AAAGAATTC-CTCATCTTTGCCAACGCCCCCTTC 
Reverse AAA CTGCAG -CTATTCCCTGTTGCGCGTGTGCCA 

267 Forward AAA GAATTC -TTCTTCCGATTCGATGTTAATCG 
Reverse AAA CTGCAG -TTAGTAAAAACCTTTCTGCTTGGC 

269 Forward AAA GAATTC -TGCAAACCTTGCGCCACGTGCCC 
Reverse AAA CTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG 

269a Forward AAAAAA GAATTC -GACTTTATCCAAAACACGGCTTCGCC 
Reverse AAA CTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG 

270 Forward AAA GAATTC -GCCGTCAAGCTCGTTTTGTTGCAATG 
Reverse AAA CTGCAG -TTATTCGGCGGTAAATGCCGTCTG 

271 Forward CGC GGATCCCATATG -CCTGTGTGCAGCTCGAC 

Reverse CCCGCTCGAG-TCCCAGCCCCGTGGAG 



Eco RJ 

Pst I 

Eco RI 

Xbal 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Pst I 

EcoRI 

Pst I 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Pst I 

BamHI- 

Ndel 

Xhol 

Kpnl 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

EcoRI 

Pst I 

BamHI- 

Ndel 

Xhol 
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272 Forward AAA GAATTC -ATGACCGCAAAGGAAGAACTGTTCGC Eco RJ 
Reverse AAA CTGCAG -TCAGAGCAGTTCCAAATCGGGGCT Pst I 

273 Forward AAA GAATTC -ATGAGTCTTCAGGCGGTATTTATATACCC Eco RJ 
Reverse AAA CTGCAG -TTACGCGTAAGAAAAAACTGC Pst I 

274 Forward CGC GGATCCCATATG -ACAGATTTGGTTACGGAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGCTTTCAGTATTATTGAA Xhol 

276 Forward AAAAAA GAATTC - Eco RI 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 
Reverse AAAAAA CTGCAG -CTACACCACCATCGGCGAATTTATGGC Pst I 

277 Forward AAAAAAGAATTC- ATGCCCCGCTTTGAGGACAAGCTCGTAGG Eco RI 
Reverse AAAAAA CTGCAG -TCATAAGCCATGCTTACCTTCCAACAA Pst I 

277a Forward AAAAAA GAATTC -GGGGCGGCGGCTGGGTTGGACGTAGG Eco RI 
Reverse AAAAAA CTGC AG -TC AT AA GCC ATGCTT ACCTTCC AAC AA Pst I 

278 Forward AAAAAA GGTACC -GTCAAAGTTGTATTAATCGGGCCTTTGCC Kpn I 
Reverse AAAAAA CTGCAG -TCATTCAACCATATCAAATCTGCC Pst I 

278a Forward AAAAAAGAATTC- AAAACTCTCCTAATTCGTCATAGTCG Eco RI 

Reverse AAAAAA CTGCAG -TCATTCAACCATATCAAATCTGCC Pst I 

279 Forward CGC GGATCCCATATG -TTGCCTGCAATCACGATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTAGAAGCGGGCGGCAA Xhol 

280 Forward AAAAAA GGTACC -GCCCCCCTGCCGGTTGTAACCAG Kpn I 
Reverse AAAAAA CTGCAG -TTATTGCTTCATCGCGTTGGTCAAGGC Pst I 

281 Forward AAAAAA GAATTC -GCACCCGTCGGCGTATTCCTCGTCATGCG Eco RI 
Reverse AAAAAA TCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG Xba I 

281a Forward AAAAAA GAATTC -TCCTACCACATCGAAATTCCTTCCGG Eco RI 

Reverse AAAAAA TCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG Xba I 

282 Forward AAAAAA GAATTC -CTTTACCTTGACCTGACCAACGGGCACAG Eco RI 
Reverse AAAAAA CTGCAG -TCAACCTGCCAGTTGCGGGAATATCGT Pst I 

283 Forward CGC GGATCCCATATG -GCCGTCTTTACTTGGAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGGCAGTATTTGTTTACG Xhol 

284 Forward CGC GGATCCCATATG -TTTGCCTGCAAAAGAATCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCGACTTTGCAAAAACTG Xhol 

286 Forward CGC GGATCCCATATG -GCCGACCTTTCCGAAAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAAGCGCGTTCCCAAG Xhol 

287 Forward CCG GAATTC TA GCTAGC -CTTTCAGCCTGCGGG EcoRI- 

Nhel 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCC Xhol 

288 Forward CGC GGATCCCATATG -CACACCGGACAGG BamHI- 

Ndel 

Reverse CCCGCTCGAG-CGTATCAAAGACTTGCGT Xhol 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGCGGA BamHI- 
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Ndel 

Reverse CCCG CTCGAG -TCGGCGCGGCGGGC Xhol 
292 Forward CGC GGATCCCATATG -TGCGGGCAAACGCCC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTGATTTTTGCGGATGATTT Xhol 

294 Forward AAAAAA GAATTC -GTCTGGTCGATTCGGGTTGTCAGAAC Eco RJ 
Reverse AAAAAA CTGCAG -TTACCAGCTGATATAAAACATCGCTTT Pst I 

295 Forward CGC GGATCCCATATG -AACCGGCCGGCCTCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGATATTTGATTCCGTTGC Xhol 

297 Forward AAAAAA GAATTC -GCATACATTGCTTCGACAGAGAG Eco Rl 
Reverse AAAAAA CTGCAG -TCAATCCGATTGCGACACGGT Pst I 

298 Forward AAAAAA GAATTC -CTGATTGCCGTGTGGTTCAGCCAAAACCC Eco Rl 
Reverse AAAAAA CTGCAG -TCATGGCTGTGTACTTGATGGTTGCGT Pst I 

299 Forward CGC GGATCCGCTAGC -CTACCTGTCGCCTCCG BamHI- 

Nhel 

Reverse CCCG CTCGAG -TTGCCTGATTGCAGCGG Xhol 
302 Forward AAAAAA GAATTC -ATGAGTCAAACCGATACGCAACG Eco Rl 

Reverse AAAAAA CTGCAG -TTAAGGTGCGGGATAGAATGTGGGCGC Pst I 

305 Forward AAAAAA GGTACC -GAATTTTTACCGATTTCCAGCACCGGA Kpn I 
Reverse AAAAAA CTGCAG -TCATTCCCAACTTATCCAGCCTGACAG Pst I 

305a Forward AAAAAA GGTACC -TCCCGTTCGGGCAGTACGATTATGGG Kpn I 

Reverse AAAAAA CTGCAG -TTACAAACCGACATCATGCAGGGTGAA Pst I 

306 Forward CGC GGATCCCATATG -TTTATGAACAAATTTTCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCGCATCGGCAGAC Xhol 
308 Forward CGC GGATCCCATATG -TTAAATCGGGTATTTTATC BamHI- 

Ndel 

Reverse CCCGCTCGAG-ATCCGCCATTCCCTGC Xhol 

311 Forward AAAAAA GGTACC -ATGTTCAGTTTTGGCTGGGTGTTT Kpn I 
Reverse AAACTGCAG-ATGTTCATATTCCCTGCCTTCGGC Pst I 

312 Forward AAAAAA GGTACC -ATGAGTATCCCATCCGGCGAAATT Kpn I 
Reverse AAA CTGCAG -TCAGTTTTTCATCGATTGAACCGG Pst I 

313 Forward AAAAAA GAATTC -ATGGACGACCCGCGCACCTACGGATC Eco Rl 
Reverse AAAAAA CTGCAG -TCAGCGGCTGCCGCCGATTTTGCT Pst I 

401 Forward CGC GGATCCCATATG -AAGGCGGCAACACAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCTTACGTTTTTCAAAGCC Xhol 

402 Forward AAAAAA GAATTC -GTGCCTCAGGCATTTTCATTTACCCTTGC Eco RJ 
Reverse AAAAAA TCTAGA -TTAAATCCCTCTGCCGTATTTGTATTC Xba I 

402a Forward AAAAAA GAATTC -AGGCTGATTGAAAACAAACACGG Eco Rl 

Reverse AAAAAA TCTAGA -TTAAATCCCTCTGCCGTATTTGTATTC Xba I 
406 Forward CGC GGATCCCATATG -TGCGGGACACTGACAG BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGTTGTCCTTGTCTATG Xhol 
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501 


Forward CGCGGATCCCATATG-GCAGGCGGAGATGGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GGTGTGATGTTCACCC 


Xhol 


502 


Forward CGCGGATCCCATATG-GTAGACGCGCTTAAGCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGCTGCATGGCGGCG 


Xhol 


503 


Forward CGCGGATCCCATATG-TGTTCGGGGAAAGGCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CCGCGCATTCCTCGCA 


Xhol 


504 


Forward CGCGGATCCCATATG-AGCGATATTGAAGTGACG 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TGATTCAAGTCCTTGCCG 


Hindlll 


505 


Forward CGCGGATCCCATATG-TTTCGTTTACAATTCAGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CGGCGTTTTATAGCGG 


Xhol 


510 


Forward CGCGGATCCCATATG-CCTTCGCGGACAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GCGCACTGGCAGCG 


Xhol 


512 


Forward CGCGGATCCCATATG-GGACATGAAGTAACGGT 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AGGAATAGCCTTTGACG 


Xhol 


515 


Forward CGCGGATCCCATATG-GAGGAAATAGCCTTCGA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AAATGCCGCAAAGCATC 


Xhol 


516 


Forward CGCGGATCCCATATG-TGTACGTTGATGTTGTGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGCGGGCGGCATC 


Xhol 


517 


Forward CGCGGATCCCATATG-GGTAAAGGTGTGGAAATA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GTGCGCCCAGCCGT 


Xhol 


518 


Forward AAAGAATTC-GCTTTTTTACTGCTCCGACCGGAAGG 


EcoRI 




Reverse AAACTGCAG-TCAAATTTCAGACTCTGCCAC 


Pst I 


519 


Forward CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTTGGCGGTTTTGCTGC 


Xhol 


520 


Forward CGCGGATCCCATATG-CCTGCGCTTCTTTCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-ATAT1TACATTTCAGTCGGC 


Xhol 


521 


Forward CGCGGATCCCATATG-GCCAAAATCTATACCTGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TTCTGATTTCAAATCGGCA 


Hindlll 


523 


Forward CGCGGATCCCATATG-GCTCTGCTTTCCGCG 


BamHI- 






Ndel 
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Reverse CCCG CTCGAG -AGGGTGTGTGATAATAAGAAG 
525 Forward CGC GGATCCCATATG -GCCGAAATGGTTCAAATC 

Reverse CCCG CTCGAG -GCCCGTGCATATCATAAA 
527 Forward AAA GAATTC -TTCCCTCAATGTTGCCGTTTTCG 

Reverse AAA CTGCAG -TTATGCTAAACTCGAAACAAATTC 

529 Fonvard CGC GGATCCGCTAGC -TGCTCCGGCAGCAAAAC 

Reverse GCCC AAGCTT -ACGCAGTTCGGAATGGAG 

530 Forward CGC GGATCCCATATG -AGTGCGAGCGCGG 

Reverse CCCG CTCGAG -ACGACCGACTGATTCCG 

531 Forward AAAAAA GAATTC -TATGCCGCCGCCTACCAAATCTACGG 
Reverse AAAAAA CTGCAG -TTAAAACAGCGCCGTGCCGACGACAAG 

532 Forward AAAAAA GAATTC -ATGAGCGGTCAGTTGGGCAAAGGTGC 
Reverse AAAAAA CTGCAG -TCAGTGTTCCAAGTGGTCGGTATCAAA 

532a Forward AAAAAA GAATTC -TTGGGTGTCGCGTTTGAGCCGGAAGT 
Reverse AAAAAA CTGCAG -TCAGTGTTCCAAGTGGTCGGTATCAAA 

535 Forward AAA GAATTC -ATGCCCTTTCCCGTTTTCAGAC 
Reverse AAA CTGCAG -TCAGACGACCCCGCCTTCCCC 

537 Forward CGC GGATCCCATATG -CATACCCAAAACCAATCC 

Reverse CCCG CTCGAG -ATCCTGCAAATAAAGGGTT 

538 Forward CGC GGATCCCATATG -GTCGAGCTGGTCAAAGC 

Reverse CCCG CTCGAG -TGGCATTTCGGTTTCGTC 

539 Forward CGC GGATCCGCTAGC -GAGGATTTGCAGGAAA 

Reverse CCCG CTCGAG -TACCAATGTCGGCAAATC 

542 Forward AAA GAATTC -ATGCCGTCTGAAACCGTGTC 
Reverse AAA CTGCAG -TTACCGCGAACCGGTCAGGAT 

543 Forward AAAAAA GAATTC -GCCTTCGATGGCGACGTTGTAGGTAC 
Reverse AAAAAA TCTAGA - 

TTAATGAAGAAGAACATATTGGAATTTTGG 
543a Forward AAAAAA GAATTC -GGCAAAACTCGTCATGAATTTGC 
Reverse AAAAAA TCTAGA - 

TTAATGAAGAAGAACATATTGGAATTTTGG 

544 Forward AAA GAATTC -GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG 
Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT 

544a Forward AAAAAA GAATTC - 

GCAAATGACTATAAAAACAAAAACTTCCAAGTACTTGC 
Reverse AAA CTGCAG -CTATTGCGCCACGCGCGTATCGAT 

547 Forward AAA GAATT C-ATGTTCGTAGATAACGGATTTAATAAAAC 
Reverse AAA CTGCAG -TTAACAACAAAAAACAAACCGCTT 

548 Forward AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC 



Xhol 

BamHI- 

Ndel 

Xhol 

Eco RJ 
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Ndel 
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Reverse 


AAACTGCAG-TCAGAGCAGGGTCCTTACATCGGC 


Pstl 


550 


Forward 


AAAAAAGTCGAC- 


Sail 






ATGATAACGGACAGGTTTCATCTCTTTCATTTTCC 






Reverse 


AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC 


Pst I 


550a 


Forward 


AAAAAAGAATTC-GTAAATCACGCCTTTGGAGTCGCAAACGG 


Eco RJ 




Reverse 


AAACTGCAG-TTACGCAAACGCTGCAAAATCCCC 


Pstl 


552 


Forward 


AAAAAAGAATTC-TTGGCGCGTTGGCTGGATAC 


EcoRJ 




Reverse 


AAACTGCAG-TTATTTCTGATGCCTTTTCCCAAC 


Pstl 


554 


Forward 


CGCGGATCCCATATG-TCGCCCGCGCCCAAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTGCCCTGTCAGACAC 


Xhol 


556 


Forward 


AAAGAATTC-GCGGGCGGTTTTGTTTGGACATCCCG 


Eco RI 




Reverse 


AAACTGCAG-TTAACGGTGCGGACGTTTCTGACC 


Pstl 


557 


Forward 


CGCGGATCCCATATG-TGCGGTTTCCACCTGAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCGCCTTCAGAAAGG 


Xhol 


558 


Forward 


AAAGAATTC-GAGCTTTATATGTTTCAACAGGGGACGGC 


Eco RI 




Reverse 


AAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA 


Pstl 


558a 


Forward 


AAAAAAGAATTC-ATTAGATTCTATCGCCATAAACAGACGGG 


Eco RI 




Reverse 


AAAAAACTGCAG-CTAAACAATGCCGTCTGAAAGTGGAGA 


Pstl 


560 


Forward 


AAAAAAGAATTC- 


Eco RI 






TCGCCTTTCCGGGACGGGGCGCACAAGATGGC 






Reverse 


AAAAAACTGCAG-TCATGCGGTTTCAGACGGCATTTTGGC 


Pstl 


561 


Forward 


CCGGAATTCTACATATG-ATACTGCCAGCCCGT 


EcoRI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCAAGCTTTCTTCAGATG 


Xhol 


562 


Forward 


CGCGGATCCCATATG-GCAAGCCCGTCGAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGACCAACTCCAACTCGT 


Xhol 


565 


Forward 


CGCGGATCCCATATG-AAGTCGAGCGCGAAATAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGCATTGATCGGCGGC 


Xhol 


566 


Forward 


CGCGGATCCCATATG-GTCGGTGGCGAAGAGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGCATGGGCGAAGTCA 


Xhol 


567 


Forward 


CCGGAATTCTACATATG-AGTGCGAACATCCTTG 


EcoRI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCCCCGACACCCTCG 


Xhol 


568 


Forward 


CGCGGATCCCATATG-CTCAGGGTCAGACC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGGCGCGGCGTTCAG 


Xhol 


569 


Forward 


AAAAAAGAATTC-CTGATTGCCTTGTGGGAATATGCCCG 


Eco RI 




Reverse 


AAAAAACTGCAG-TTATGCATAGACGCTGATAACGGCAAT 


Pstl 


570 


Forward 


CGCGGATCCCATATG-GACACCTTCCAAAAAATCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGGGCGTTCATTTCTTT 


Xhol 



SUBSTITUTE SHEET (RULE 26) 



WO 99/057280 



PCT/US99/09346 



154 



571 


Forward 


AAAAAAGAATTC- 


Eco RJ 






ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 






Reverse 


AAAAAACTGCAG-TTATGGCCGACGCGCGGCTACCTGACG 


Pst I 


572 


Forward 


CGCGGATCCCATATG-GCGCAAAAAGGCAAAACC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGCAGTGTGCCGATA 


Xhol 


573 


Forward 


CGCGGATCCCATATG-CCCTGTTTGTGCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GACGGTGTCATTTCGCC 


Xhol 


574 


Forward 


CGCGGATCCCATATG-TGGTTTGCCGCCCGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AACTTCGATTTTATTCGGG 


Xhol 


575 


Forward 


CGCGGATCCCATATG-GTTTCGGGCGAGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CATTCCGAATCTGAACAG 


Xhol 


576 


Forward 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 


BarnHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATTTACTTTTTTGATGTCGAC 


Xhol 


577 


Forward 


CGCGGATCCCATATG-GAAAGGAACGGTGTATTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGGCTGTTTGGTAGATTCG 


Xhol 


578 


Forward 


CGCGGATCCCATATG-AGAAGGTTCGTACAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCCAACGCCTCCACG 


Xhol 


579 


Forward 


CGCGGATCCCATATG-AGATTGGGCGTTTCCAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGAATTGATGATGTGTATGT 


Xhol 


580 


Forward 


CGCGGATCCCATATG-AGGCAGACTTCGCCGA 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-CACTTCCCCCGAAGTG 


Xhol 


581 


Forward 


CGCGGATCCCATATG-CACTTCGCCCAGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CGCCGTTTGGCTTTGG 


Xhol 


582 


Forward 


AAAAAAGAATTC-TTTGGAGAGACCGCGCTGCAATGCGC 


EcoRJ 




Reverse 


AAAAAATCTAGA-TCAGATGCCGTCCCAGTCGTTGAA 


Xbal 


583 


Forward 


AAAAAAGAATTC-ACTGCCGGCAATCGACTGCATAATCG 


Eco RI 




Reverse 


AAAAAACTGCAG-TTAACGGAGGTCAATATGATGAAATTG 


Pst I 


584 


Forward 


AAAAAAGAATTC- 


Eco RI 






GCGGCTGAAGCATTGAATTACAATATTGTC 






Reverse 


AAAAAACTGCAG-TCAGAACTGAACCGTCCCATTGACGCT 


Pst I 


585 


Forward 


AAAAAAGGTACC-TCTTTCTGGCTGGTGCAGAACACCCTTGC 


Eco RI 




Reverse 


AAAAj^CTGCAG-TCAGTTCGCACTTTTTTCTGTTTTGGA 


Pst I 


586 


Forward 


CGCGGATCCCATATG-GCAGCCCATCTCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCAGCGAATCAAGTTTC 


Xhol 


587 


Forward 


CGCGGATCCCATATG-GACCTGCCCTTGACGA 


BamHI- 
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Reverse CCCG CTCGAG -AAATGTATGCTGTACGCC 

588 Forward AAAAAA GAATTC -GCCGTCCTGACTTCCTATCAAGAACCAGG 
Reverse AAAAAA CTGCAG -TTATTTGTTTTTGGGCAGTTTCACTTC 

589 Forward AAAAAA GAATTC - 

ATGCAACAAAAAATCCGTTTCCAAATCGAAGG 
Reverse AAAAAA CTGCAG -CTAATCGATTTTTACCCGTTTCAGGCG 

590 Forward AAAAAA GAA7TC -ATGAAAAAACCTTTGATTTCAGTTGCGGC 
Reverse AAAAAACTGCAG-TTACTGCTGCGGCTCTGAAACCAT 

591 Forward AAAAAA GAATTC -CACTACATCGTTGCCAGATTGTGCGG 
Reverse AAAAAA CTGCAG -CTAACCGAGCAGCCGGGTAACGTCGTT 

592a Forward AAAAAA GAATTC -CGCGATTACACCGCCAAGCTGAAAATGGG 

Reverse AAAAAA CTGCAG -TTACCAAACGTCGGATTTGATACG 
593 Forward CGC GGATCCGCTAGC -CTTGAACTGAACGGACTC 

Reverse CCCG CTCGAG -GCGGAAGCGGACGATT 
594a Forward AAAAAA GAATTC -GGTAAGTTCGCCGTTCAGGCCTTTCA 
Reverse AAAAAA CTGCAG -TTACGCCGCCGTTTCCTGACACTCGCG 

595 Forward AAAAAA GAATTC -TGCCAGCCGCCGGAGGCGGAGAAAGC 
Reverse AAAAAA CTGCAG -TTATTTCAAGCCGAGTATGCCGCG 

596 Forward CGC GGATCCCATATG -TCCCAACAATACGTC 

Reverse CCCGCTCGAG-ACGCGTTACCGGTTTGT 

597 Forward CGC GGATCCCATATG -CTGCTTCATGTCAGC 

Reverse GCCCAAGCTT-ACGTATCCAGCTCGAAG 

601 Forward CGC GGATCCCATATG -ATATGTTCCCAACCGGCAAT 

Reverse CCCG CTCGAG -AAAACAATCCTCAGGCAC 

602 Forward CGC GGATCCGCTAGC -TTGCTCCATCAATGC 

Reverse CCCG CTCGAG -ATGCAGCTGCTAAAAGCG 

603 Forward AAAAAA GAATTC -CTGTCCTCGCGTAGGCGGGGACGGGG 
Reverse AAAAAA CTGCAG -CTACAAGATGCCGGCAAGTTCGGC 

604 Forward CGC GGATCCGCTAGC -CCCGAAGCGCACTT 

Reverse CCCGCTCGAG-GACGGCATCTGCACGG 
606a Forward AAAAAA GAATTC -CGCGAATACCGCGCCGATGCGGGCGC 
Reverse AAAAAA CTGCAG -TTAAAGCGATTTGAGGCGGGCGATACG 

607 Forward AAAAAA GAATTC -ATGCTGCTCGACCTCAACCGCTTTTC 
Reverse AAAAAA CTGCAG -TCAGACGGCCTTATGCGATCTGAC 

608 Forward AAAAAA GAATTC -ATGTCCGCCCTCCTCCCCATCATCAACCG 
Reverse AAAAAA CTGCAG -TTAGTCTATCCAAATGTCGCGTTC 

609 Forward CGCGGATCCCATATG-GTTGTGGATAGACTCG 



Ndel 
Xhol 
Eco RJ 
Pst 1 
Eco RJ 

Pst I 

Eco RJ 

Pst I 

Eco RJ 

Pst I 

Eco RJ 

Pst I 

BamHI- 

Nhel 

Xhol 

Eco RJ 

Pst I 

Eco RJ 

Pst I 

BamHI- 

NdeJ 

Xhol 

BamHI- 

Ndel 

Hindlll 

BamHI- 

Ndel 

Xhol 

BamHI- 

Nhel 

Xhol 

Eco RI 

Pst I 

BamHI- 

Nhel 

Xhol 

Eco RJ 

Pst I 

Eco RJ 

Pst I 
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Pst I 

BamHJ- 

Ndel 
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Reverse CCC GCTCGAG -CTGGATTATGATGTCTGTC 

610 Forward CGC GGATCCCATATG -ATTGGAGGGCTTATGCA 

Reverse CCCG CTCGAG -ACGCTTCAACATCTTTGCC 

611 Forward CGC GGATCCCATATG -CCGTCTCAAAACGGG 

Reverse CCCG CTCGAG -AACGACTTTGAACGCGCAA 

613 Forward CGC GGATCCCATATG -TCGCGTTCGAGCCG3 

Reverse CCCG CTCGAG -AGCCTGTAAAATAAGCGGC 

614 Forward CGC GGATCCCATATG -TCCGTCGTGAGCGGC 

Reverse CCCG CTCGAG -CCATACTGCGGCGTTC 
616 Forward AAAAAA GAATTC -ATGTCAAACACAATCAAAATGGTTGTCGG 

Reverse AAAAAA TCTAGA -TTAGTCCGGGCGGCAGGCAGCTCG 
619a Forward AAAAAA GAATTC -GGGCTTCTCGCCGCCTCGCTTGC 

Reverse AAAAAA CTGCAG -TCATTTTTTGTGTTTTAAAACGAGATA 
622 Forward CGC GGATCCCATATG -GCCGCCCTGCCTAAAG 

Reverse CCCG CTCGAG -TTTGTCCAAATGATAAATCTG 

624 Forward CGC GGATCCCATATG -TCCCCGCGCTTTTACCG 

Reverse CCCG CTCGAG -AGATTCGGGCCTGCGC 

625 Forward CGC GGATCCCATATG -TTTGCAACCAGGAAAATG 

Reverse CCCG CTCGAG -CGGCAAAATTACCGCCTT 
627a Forward AAAAAA GAATTC -AAAGCAGGCGAGGCAGGCGCGCTGGG 

Reverse AAAAA ACTGCAG - 

TTACGAATGAAACAGGGTACCCGTCATCAAGGC 
628 Forward AAAAAA GGTACC -GCCTTACAAACATGGATTTTGCGTTC 

Reverse AAAAAA CTGCAG -CTACGCACCTGAAGCGCTGGCAAA 
629a Forward AAAAAA GAATTC -GCCACCTTTATCGCGTATGAAAACGA 

Reverse AAAAAA CTGCAG -TTACAACACCGCCGTCCGGTTCAAACC 
630a Forward AAAAAA GAATTC -GCGGCTTTGGGTATTTCTTTCGG 

Reverse AAAAAA CTGCAG -TTAGGAGACTTCGCCAATGGAGCCGGG 
635 Forward AAAAAA GAATTC - 

ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 

Reverse AAAAAA CTGCAG -TTAATCCACTATAATCCTGTTGCT 

638 Forward AAAAAA GAATTC -ATGATTGGCGAAAAGTTTATCGTAGTTGG 
Reverse AAAAAA CTGCAG -TCACGAACCGATTATGCTGATCGG 

639 Forward CGC GGATCCCATATG -ATGCTTTATTTTGTTCG 

Reverse CCCG CTCGAG -ATCGCGGCTGCCGAC 
642 Forward CGC GGATCCCATATG -CGGTATCCGCCGCAAT 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA 



Xhol 

BamHI- 

Ndel 

Xhol 

BamHl- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Eco RI 

Xbal 

Eco RI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

PstI 

Kpnl 

PstI 

Eco RI 

PstI 

EcoRI 

PstI 

EcoRI 

PstI 

Eco RI 

PstI 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 
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643 Forward CGC GGATCCCATATG -GCTTCGCCGTCGGCAG 

Reverse CCCGCTCGAG-AACCGAAAAACAGACCGC 

644 Forward AAAAAA GAATTC - 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 
Reverse AAAAAA TCTAGA -CTACCCGCAATATCGGCAGTCCAATAT 

645 Forward AAAAAA GAATTC -GTGGAACAGAGCAACACGTTAAATCG 
Reverse AAAAAA CTGCAG -CTACGAGGAAACCGAAGACCAGGCCGC 

647 Forward AAAAAA GAATTC -ATGCAAAGGCTCGCCGCAGACGG 
Reverse AAAAAA CTGCAG -TTAGATTATCAGGGATATCCGGTAGAA 

648 Forward AAAAAAGAATTC- 

ATGAACAGGCGCGACGCGCGGATCGAACG 
Reverse AAAAAA CTGCAG -TCAAGCTGTGTGCTGATTGAATGCGAC 

649 Forward AAAAAA GAATTC -GGTACGTCAGAACCCGCCCACCG 
Reverse AAAAAA CTGCAG -TTAACGGCGGAAACTGCCGCCGTC 

650 Forward AAAAAA GAATTC -ATGTCCAAACTCAAAACCATCGC 
Reverse AAAAAA CTGCAG -TCAGACGGCATGGCGGTCTGTTTT 

652 Forward AAAAAA GGTACC - 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 
Reverse AAAAAA CTGCAG -TTATTTGCCCAGTTGGTAGAATGCGGC 

653 Forward AAAAAA GAATTC -GCGGCTTTGCCGGTAATTTTCATCGG 
Reverse AAAAAACTGCAG-CTATGCCGGTCTGGTTGCCGGCGGCGA 

656a Forward AAAAAA GAATTC -CGGCCGACGTCGTTGCGTCCTAAGTC 
Reverse AAAAAA CTGCAG -CTACGATTTCGGCGATTTCCACATCGT 

657 Forward AAAAAA GAATTC -GCAGAATTTGCCGACCGCCATTTGTGCGC 
Reverse AAAAAA CTGCAG -TTATAGGGACTGATGCAGTTTTTTTGC 

658 Forward CGC GGATCCCATATG -GTGTCCGGAATTGTG 

Reverse CCCG CTCGAG -GGCAGAATGTTTACCGTT 
661 Forward AAAAAAGAATTC- 



Reverse 



ATGCACATCGGCGGCTATTTTATCGACAACCC 
AAAAAACTGCAG-TCACGACGTGTCTGTTCGCCGTCGGGC 



663 Forward CGC GGATCCCATATG -TGTATCGAGATGAAATT 

Reverse CCCGCTCGAG-GTAAAAATCGGGGCTGC 

664 Forward CGC GGATCCCATATG -GCGGCTGGCGCGGT 

Reverse CCCG CTCGAG -AAATCGAGTTTTACACCAC 

665 Forward AAAAAA GAATTC -ATGAAATGGGACGAAACGCGCTTCGG 
Reverse AAAAAA CTGCAG -TCAATCCAAAATTTTGCCGACGATTTC 

666 Forward AAAAAA GAATTC -AACTCAGGCGAAGGAGTGCTTGTGGC 
Reverse AAAAAA TCTAGA -TCAGTTTAGGGATAGCAGGCGTAC 

667 Forward AAAAAAGAATTC- 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 
Reverse AAAAAACTGCAG-TTAATGACACAATAGGCGCAAGTC 



BamHl- 
Ndel 
Xhol 
Eco RI 

PstI 
Eco Rl 
Pstl 
Eco RJ 
Pstl 
Eco RJ 

Pstl 
Eco RI 
Pstl 
Eco RI 
Pst I 
Kpnl 

Pstl 

Eco Rl 

Pstl 

Eco Rl 

Pstl 

Eco RI 

Pstl 

BamHI- 

Ndel 

Xhol 

EcoRI 

Pstl 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

EcoRI 

Pst I 

EcoRI 

Xbal 

EcoRI 

Pst I 
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669 Forward AAAAAA GAATTC -ATGCGCCGCATCATTAAAAAACACCAGCC 
Reverse AAAAAA CTGCAG -TTACAGTATCCGTTTGATGTCGGC 

670a Forward AAAAAA GAATTC -AAAAACGCTTCGGGCGTTTCGTCTTC 
Reverse AAAAAA CTGCAG - 

TTAGGAGCTTTTGGAACGCGTCGGACTGGC 

671 Forward CGC GGATCCCATATG -ACCAGCAGGGTAAC 

Reverse CCCG CTCGAG -AGCAACTATAAAAACGCAAG 

672 Forward CGC GGATCCCATATG -AGGAAAATCCGCACC 

Reverse CCC GCTCGAG -ACGGGATAGGCGGTTG 

673 Forward AAAAAA GAATTC -ATGGATATTGAAACCTTCCTTGCAGG 
Reverse AAAAAA CTGCAG -CTACAAACCCAGCTCGCGCAGGAA 

674 Forward AAAAAA GAATTC -ATGAAAACAGCCCGCCGCCGTTCCCG 
Reverse AAAAA ACTGCAG -TCAACGGCGTTTGGGCTCGTCGGG 

675 Forward CGC GGATCCCATATG -AACACCATCGCCCC 

Reverse CCCG CTCGAG -TTCTTCGTCTTCAAACTGT 
677a Forward AAAAAA GAATTC -AGACGGCATTCCCGATCAGTCGATTTTGA 
Reverse AAAAAACTGCAG-TTACGTATGCGCGAAATCGACCGCCGC 

680 Forward CGC GGATCCGCTAGC -ACGAAGGGCAGTTCGG 

Reverse CCCG CTCGAG -CATCAAAAACCTGCCGC 

681 Forward AAAAAA GAATTC -ATGACGACGCCGATGGCAATCAGTGC 
Reverse AAAAAA CTGCAG -TTACCGTCTTCCGCAAAAAACAGC 

683 Forward CGC GGATCCCATATG -TGCAGCACACCGGACAA 

Reverse CCC GCTCG AG -GAG 1 1T1 T1TCCGCATACG 

684 Forward CGC GGATCCCATATG -TGCGGTACTGTGCAAAG 

Reverse CCC GCTCGAG -CTCGACCATCTGTTGCG 

685 Forward CGC GGATCCCATATG -TGTTTGCTTAATAATAAACATT 

Reverse CCC GCTCGAG -CTTTTICCCCGCCGCA 

686 Forward CGC GGATCCCATATG -TGCGGCGGTTCGGAAG 

Reverse CCCG CTCGAG -CATTCCGATTCTGATGAAG 

687 Forward CGC GGATCCCATATG -TGCGACAGCAAAGTCCA 

Reverse CCCG CTCGAG -CTGCGCGGCTTTTTGTT 

690 Forward CGC GGATCCCATATG -TGTTCTCCGAGCAAAGAC 

Reverse CCCG CTCGAG -TATTCGCCCCGTGTTTGG 

691 Forward CGC GGATCCCATATG -GCCACGGCTTATATCCC 

Reverse CCCGCTCGAG-TTTGAGGCAGGAAGAAAG 



Eco RI 
Pst I 
Eco RI 
Pst I 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Eco RI 

Pst I 

Eco RI 

Pst I 

BamHI- 

Ndel 

Xhol 

Eco RI 

Pst I 

BamHI- 

Nhel 

Xhol 

Eco RI 

Pst I 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

Bamffl- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 
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694 Forward CGC GGATCCCATATG -TTGGTTTCCGCATCCGG 

Reverse CCCG CTCGAG -TCTGCGTCGGTGCGGT 

695 Forward CGC GGATCCCATATG -TTGCCTCAAACTCGTCCG 

Reverse CCCG CTCGAG -TCGTTTGCGCACGGCT 

696 Forward CGC GGATCCCATATG -TTGGGTTGCCGGCAGG 

Reverse CCCG CTCGAG -TTGATTGCCGCAATGATG 
700a Forward AAAAAA GAATTC -GCATCGACAGACGGTGTGTCGTGGAC 
Reverse AAAAAACTGCAG-TTACGCTACCGGCACGACTTCCAAACC 

701 Forward CGC GGATCCCATATG -AAGACTTGTTTGGATACTTC 

Reverse CCCG CTCGAG -TGCCGACAACAGCCTC 

702 Forward AAAAAA GAATTC -ATGCCGTGTTCCAAAGCCAGTTGGATTTC 
Reverse AAAAAA CTGCAG -TTAACCCCATTCCACCCGGAGAACCGA 

703 Forward CGC GGATCCGCTAGC -CAAACGCTGGCAACCG 

Reverse CCCG CTCGAG -TTTTGCAGGTTTGATGTTTG 
704a Forward AAAAAAGAATTC-GCTTCTACCGGTACGCTGGCGCG 
Reverse AAAAAA CTGCAG - 

TTAGTTTTGCCGGATAATATGGCGGGTGCG 

707 Forward CGC GGATCCGCTAGC -GAAATTATTAACGATGCAGA 

Reverse CCCG CTCGAG -GAAACTGTAATTCAAGTTGA 

708 Forward CGC GGATCCGCTAGC -CCTTTTAAGCCATCCAAAA 

Reverse CCCG CTCGAG -TTGACCGGTGAGGACG 
710 Forward CGC GGATCCCATATG -GAAACCCACGAAAAAATC 

Reverse CCCG CTCGAG -AACGGTTTCGGTCAG 
714 Forward CGC GGATCCCATATG -AGCTATCAAGACATCTT 

Reverse CCCG CTCGAG -GCGGTAGGTAAATCGGAT 
716 Forward CG CGGATCCCATATG -GCCAACAAACCGGCAAG 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC 
718 Forward CGC GGATCCCATATG -G AGCCG ATAATGGCAA A 

Reverse CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC 
720 Forward CGC GGATCCCATATG -AGCGGATGGCATACC 

Reverse CCCGCTCGAG-TTTTGCATAGCTGTTGACCA 
723 Forward CGC GGATCCCATATG -CGACCCAAGCCCC 

Reverse CCCGCTCGAG-AATGCGAATCCGCCGCC 



BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHl- 

Ndel 

Xhol 

Eco Rl 

Pst I 

BamHI- 

Ndel 

Xhol 

Eco RI 

Pst I 

BamHI- 

Nhel 

Xhol 

Eco RI 

Pst I 

BamHI- 

Nhel 

Xhol 

BamHI- 

Nhel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 

BamHI- 

Ndel 

Xhol 
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725 


Forward 

a \j\ w sax v* 


CGCGGATCCCATATG-GTGCGCACGGTTAAA 


BamHI- 

U til* XJL A A 








Ndel 




Reverse 


CCCGCTCGAG-TTGCTTATCCTTAAGGGTTA 


Xhol 


726 


Forward 

1 VII IT Hi VI 


CGCGGATCCCATATG-ACCATCTATTTCAAAAAC 

V-y VJ V^y VJ V J 7 L J ^— <^ V,*" L ^ X * A J * V-*' V-* / V A V-/ A X L X A A V* jjW u fc V-* 


BamHI- 

AJ CXI 1 LA A A 








Ndel 




Reverse 


CCCGCTCGAG-GCCGATGTTTAGCGTCC 


Xhol 


728 


Forward 


CGCGGATCCCATATG-TTTTGGCTGGGAACGGG 

\^V,l\/\J V_J j» A V^ V*-' V-** J m. A J m. A VJ A A A A VJ VJ V" A VJ VJ VJ *fUT» V-* VJ VJ VJ 


BamHI- 

U Oiili Jl A 








Ndel 




Reverse 


CCCGCTCGAG-GTGAGAAAGGTCGCGC 


Xhol 


729 


Forward 

1 VJJ TT Hi VI 


CGCGGATCCCATATG-TGCACCATGATTCCCCA 

V< VI V/ VI Vll» * W V^ y ». A ^ *- A VJ A VI V*,/* V< V> a ± Vlil X J. V> V^ V/ V/ii 


BamHI- 

I~J OJ. 1 La a l 








Ndel 




Reverse 


GCCCAAGCTT-TTTGTCGGTTTGGGTATC 


Hindlll 




Forward 


CGCGGATCCGCTAGC-GCCGTGCCGGAGG 

v/vivivivim v* v_j \_/ i rvvjv viv/vvj -i vj v-» v^ vj vj /*■ vj vj 


BamHI- 

AJ CLl 1 LI 11 








Nhel 




Reverse 


CCCGCTCGAG-ACGGGCGCGGCAG 


Xhol 


732 


Forward 

J vM W cU u 


rCGGAATTCTACATATG-TCGAAArrTGTTTTTAAGAA 

V/V>VJ VJ/ui 1 1 A *V V' J* A / V l VJ J v- VJ rwuVVy v^ X vl 11111 l\~r*. VJ ^i-jV 


FroRT- 

I^V/VJIVI ~ 








Ndel 




Reverse 


CCCGCTCGAG-CTTCTTATCTTTTTTATCTTTC 


Xhol 


733 


Foi-u/arH 

I Ul VV CU u 


CGCGG ATCCC AT ATG-GCCTGCGGCGGO A A 

V/VJV' vl VJi* 1 \^V/v/\ 1 i\ 1 VJ \Jvrv> 1 VJ V' VJ VJ V/'VJ VJ v- 


BamHT- 








Ndel 




Reverse 


CCCGCTCGAG-TCGCTTGCCTCCTTTAC 


Xhol 


734 


Fru^w/arH 

I UJ W di VI 


CGCGGATCCC AT ATG-GCCG AT ACTTACGGCT AT 

V- VJ V/ VJ vl/i 1 V- V^ V< 1 ./Y 1 VJ Uv/v-VJA 1 Av J. J AV>\JVJV' 1 ^\ 1 


DoJ 11111— 








Ndel 




Reverse 


CCCGCTCGAG-TTTGAGATTTTGAATCAAAGAG 


Xhol 


735 


Forward 

A V/l VV CU VI 


CGCGGATCCCATATG-AAGCAGCAGGCGGTCA 

v\J\/\j\j/i i v^v-v-i* a j \ a vi fxnvi vnwnvj VJ V/\J VJ i V-* 1 * 


BamHI- 

JLJOIIU A.X 








Ndel 




Reverse 


CCCGCTCGAG-ATTTCCGTAGCCGAGGG 


Xhol 


737 


Forward 

1 VJ 1 W OJ u 


CGCGGATCCCATATG-CACCACGACGGACACG 

V\JV<VI\Jiil V^V*V^^» A ^ 1 A V| V><kV< V*riVVlJiV\JVJiiV-ilVrVJ 


aj al 1 Li aa 








Ndel 




Reverse 


CCCGCTCGAG-GTCGTCGCGGCGGGA 


Xhol 


739 


Forward 


CGCGGATCCCATATG-GCAAAAAAACCGAACA 

V*s VJ w VJ VJ d jV A V** V-** V^* J » A J V A VJ \J V*i l^t \*U\**lm~l 1*-* k_/ jWV*' v\Jifcifc w*» 


RamHT- 








Ndel 




Reverse 


CCCGCTCGAG-GAAGAGTTTGTCGAGAATT 


Xhol 


740 


Forward 


CGCGGATCCCATATG-GCCAATCCGCCCGAAG 


BamHI- 

AJ £U M LA. JtJL 








Ndel 




Reverse 


CCCGCTCGAG-AAACGCGCCAAAATAGTG 


Xhol 


741 


Forward 


CGCGGATCCCATATG-TGCAGCAGCGGAGGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 


743 


Forward 


CGCGGATCCCATATG-GACGGTGTTGTGCCTGTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTTACGGATCAAATTGACG 


Xhol 


745 


Forward 


CGCGGATCCCATATG-rnTGGCAACTGACCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CAAATCAGATGCCTTTAGG 


Xhol 


746 


Forward 


CGCGGATCCCATATG-TCCGAAAACAAACAAAAC 


BamHI- 
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Ndel 




Reverse 


CCCGCTCGAG-TTCATTCGTTACCTGACC 


Xhol 


747 


Forward 


CCGGAATTCTAGCTAGC-CTGACCCCTTGGG 


EcoRl- 








Nhel 




Reverse 


GCCC AAGCTT-TTTTGA 1 1 ' 1 1 AATTGACTATAGAAC 


Hindlll 


749 


Forward 


CGCGGATCCCATATG-TGCCAGCCGCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCAAGCCGAGTATGC 


Xhol 


750 


Forward 


CGCGGATCCCATATG-TGTTCGCCCGAACCTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTTTTTCCCCGCCGCAA 


Xhol 


758 


Forward 


CGCGGATCCCATATG-AACAATCTGACCGTGTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGGCTCAATCCTTTCTGC 


Xhol 


759 


Forward 


CGCGGATCCGCTAGC-CGCTTCACACACACCAC 


BamHI- 








Nhel 




Reverse 


CCCGCTCGAG-CC AGTTGTAGCCT A 1"! "1 " J G 


Xhol 


763 


Forward 


CGCGGATCCCATATG-CTGCCTGAAGCATGGCG 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCGCAAATACCGTTTCC 


Xhol 


764 


Forward 


CGCGGATCCCATATG-TTTTTCTCCGCCCTGA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGCTCCCTAAAGCTTTC 


Xhol 


765 


Forward 


CGCGGATCCCATATG-TTAAGATGCCGTCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACGCCGACG1 1 ITITATTAA 


Xhol 


767 


Forward 


CGCGGATCCCATATG-CTGACGGAAGGGGAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCTGTACAGCAGGGG 


Xhol 


768 


Forward 


CGCGGATCCCATATG-GCCCCGCAAAAACCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTCATCCCTTTTTTGAGC 


Xhol 


770 


Forward 


CGCGGATCCCATATG-TGCGGCAGCGGCGAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GCGTTTGTCGAGATTTTC 


Xhol 


771 


Forward 


CGCGGATCCCATATG-TCCGTATATCGCACCTTC 


BamHl- 








Ndel 




Reverse 


CCCGCTCGAG-CGGTTCTTTAGGTTTGAG 


Xhol 


772 


Forward 


CGCGGATCCCATATG-TTTGCGGCGTTGGTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CAATGCCGACATCAAACG 


Xhol 


774 


Forward 


CGCGGATCCCATATG-TCCGTTTCACCCGTTCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGTTTGCGCACGGCT 


Xhol 


790 


Forward 


CGCGGATCCCATATG-GCAAGAAGGTCAAAAAC 


BamHI- 








Ndel 
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Reverse 


CCCGCTCGAG-GGCGTTGTTCGGATTTCG 




900 


Forward 


CGCGGATCCCATATG-CCGTCTGAAATGCCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATATGGAAAAGTCTGTTGTC 


Xhol 


901 


Forward 


CGCGGATCCCATATG-CCCGATTTTTCGATG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAAATGGAACAATACCAGG 


Xhol 


902 


Forward. 


CCGGAATTCTACATATG-TTGCACTTTCAAAGGATAATC 


EcoRI- 




2 




Ndel 




Reverse 


CCCGCTCGAG-AAAAATGTACAATGGCGTAC 


Xhol 


903 


Forward 


CCGGAATTCTAGCTAGC-CAGCGTCAGCAGCACAT 


EcoRJ- 








Nhel 




Reverse 


CCCGCTCGAG-GAAACTGTAATTCAAGTTGAA 


Xhol 


904 


Forward 


AAAAAAGGTACC-ATGATGCAGCACAATCGTTTC 


Kpnl 




Reverse 


AAACTGCAG-TTAATATCGATAGGTTATATG 


PstI 


904a 


Forward 


AAAAAAGAATTC-CGGCTCGGCATTGTGCAGATGTTGCA 


Eco Rl 




Reverse 


AAACTGCAG-TTAATATCGATAGGTTATATG 


PstI 


905 


Forward 


CGCGGATCCCATATG-AACAAAATATACCGCATC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCACTGATAACCGACAGAT 


Xhol 


907 


Forward 


CGCGGATCCCATATG-GGCGCGCAACGTGAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACGCCACTGCCAGCG 


Xhol 


908 


Forward 


AAAGAATTC-GCAGAGTTAGTAGGCGTTAATAAAAATAC 


Eco RI 




Reverse 


AAACTGCAG-TTAATATGGTTTTGTCGTTCG 


PstI 


909 


Forward 


CGCGGATCCCATATG-TGCGCGTGGGAAACTTAT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TCGG 1 1 1 1 GAAACTTTGGTTTT 


Xhol 


910 


Forward 


AAAGAATTC-GCATTTGCCGGCGACTCTGCCGAGCG 


EcoRI 




Reverse 


AAACTGCAG-TCAGCGATCGAGCTGCTCTTT 


PstI 


911 


Forward 


AAAGAATTC-GCTTTCCGCGTGGCCGGCGGTGC 


EcoRI 




Reverse 


AAAAAACTGCAG-GTCGACTTATTCGGCGGCTTTTTCCGC 


PstI 


912 


Forward 


AAAAAAGAATTC- 


Eco RI 






CAAATCCGTCAAAACGCCACTCAAGTATTGAG 






Reverse 


AAAAAACTGCAG-TTACAGTCCGTCCACGCCTTTCGC 


PstI 


913 


Forward 


CGCGGATCCCATATG-GAAACCCGCCCCGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGGTTGTGTTCCAGGTTG 


Xhol 


915 


Forward 


CGCGGATCCCATATG-TGCCGGCAGGCGGAA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGAAAATATAGGTATCAGG 


Xhol 


914 


Forward 


AAAGAATTC-GACAGAATCGGCGATTTGGAAGCACG 


EcoRI 




Reverse 


AAACTGCAG-CTATATGCGCGGCAGGACGCTCAACGG 


PstI 


916 


Forward 


CGCGGATCCCATATG-GCAATGATGGCGGCTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGGCGGCATCTTTCAT 


Xhol 
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917 Forward AAAAAA GAATTC -CCTGCCGAAAAACCGGCACCGGC Eco RI 

Reverse AAAAAA CTGCAG -TTATTTCCCCGCCTTCACATCCTG Pst I 

919 Forward CGC GGATCCCATATG -TGCCAAAGCAAGAGCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGGCGGTATTCGGG Xhol 

920 Forward CGC GGATCCCATATG -CACCGCGTCTGGGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATGGTGCGAATGACCGA Xhol 

921 Forward AAAAAA GAATTC -TTGACGGAAATCCCCGTGAATCC Eco RI 
Reverse AAAAAA CTGCAG -TCATTTCAAGGGCTGCATCTTCAT Pst I 

922 Forward. CGC GGATCCGCTAGC -TGTACGGCGATGGAGGC BamHI- 

2 Nhel 
Reverse CCCGCTCGAG-CAATCCCGGGCCGCC Xhol 

923 Forward CGC GGATCCCATATG -TGTTACGCAATATTGTCCC BamHI- 

Nhel 

Reverse CCCG CTCGAG -GGACAAGGCGACGAAG Xhol 

925 Forward CGC GGATCCCATATG -AAACAAATGCTTTTAGCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCGTTGCATTTGATTTC Xhol 

926 Forward CGC GGATCCCATATG -TGCGCGCAATTACCTC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TCTCGTGCGCGCCG Xhol 

927 Forward CGC GGATCCCATATG -TGCAGCCCCGCAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTTTTTTGCTGACGTAGT Xhol 
929a Forward AAAAAAGAATTC-CGCGGTTTGCTCAAAACAGGGCTGGG Eco RJ 

Reverse AAAAAATCTAGA-TTAAGAAAGACGGAAACTACTGCC Xba I 

931 Forward AAAAAA GAATTC -GCAACCCATGTTTTGATGGAAAC Eco RI 

Reverse AAAAAA CTGCAG -TTACTGCCCGACAACAACGCGACG Pst I 

935 Forward A AAAAA G AATTC - . Eco Rl 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 
Reverse AAAAAA CTGCAG -TCAAAACCGCCAATCCGCCGACAC Pst I 

936 Forward CGC GGATCCCATATG -GCCGCCGTCGGCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGTTGGACGTAGTTTTG Xhol 

937 Forward AAAAAA GAATTC -CCGGTTTACATTCAAACCGGCGCAAC Eco RI 
Reverse AAAAAA CTGCAG -TTAAAATGTATGCTGTACGCCAAA Pst I 

939a Forward AAAAAA GAATTC -GGTTCGGCAGCTGTGATGAAACC Eco RI 

Reverse AAAAAA CTGCAG -TTAACGCAAACCTTGGATAAAGTTGGC Pst I 
950 Forward CGC GGATCCCATATG -GCCAACAAACCGGCAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTAGAACCGCATTTGCC Xhol 
953 Forward CGC GGATCCCATATG -GCCACCTACAAAGTGGAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTGTTTGGCTGCCTCGAT Xhol 
957 Forward CGCGGATCCCATATG-TTTTGGCTGGGAACGGG BamHI- 
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Ndel 




Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 


Xhnl 

yVIlUx 


958 


Forward CGCGGATCCCATATG-GCCGATGCCGTTGCG 


oairLtii- 






IN (J CI 




Reverse GCCCAAGCTT-GGGTCGTTTGTTGCGTC 


Hindlll 


959 


Forward CGCGGATCCCATATG-CACCACGACGGACACG 


BamHl- 






Ndel 




Reverse CCCGCTCGAG-GTCGTCGCGGCGGGA 


Xhol 


961 


Forward CGCGGATCCCATATG-GCCACAAGCGACGACG 


R arn UT. 

O £1111X1 J 






Ndel 




Reverse CCCGCTCGAG-CCACTCGTAATTGACGC 


Xhol 


972 


Forward AAAAAAGAATTC- 


J--vU IvX 




TTGACTAACAGGGGGGGAGCGAAATTAAAAAC 






Reverse AAAAAATCTAGA-TTAAAAATAATCATAATCTACATTTTG 


Xba I 


973 


Forward AAAAAAGAATTC-ATGGACGGCGCACAACCGAAAAC 


IZrCO xvl 




Reverse AAAAAACTGCAG-TTACTTCACGCGGGTCGCCATCAGCGT 


Pst I 


982 


Forward CGCGG ATCCC ATATG-GC AGC A A A AG A CCiT A P 


Joanirii- 






Ndel 




Reverse CCCGCTCGAG-CATCATGCCGCCCATCC 


Xhol 


983 


Forward CGCGGATCCCATATG-TTAGCTGTTGCAACA AC AC 


oarririi- 






\TJ 0 T 

JNaei 




Reverse CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 


987 


Forward CGCGGATCCCATATG-CCCCCACTGGAAGAAC 


oamrii- 






INaei 




Reverse CCCGCTCGAG-TAATAAACCTTCTATGGGC 


Xhol 


988 


Forward CGCGG ATCCCATATG-TCTTTAAATTTACGGG A A A A AH 


oanirii- 






in del 




Reverse GCCCAAGCTT-TGATTTGCCTTTCCGTTTT 


Hindlll 

X XX11UXXX 


989 


Forward CCGGAATTCTACATATG-GTCCACGrATrmoPTA 


CCOKJ- 






XU *T 

Ndel 




Reverse CCCGCTCGAG-TTTGAATTTGTAGGTGTATTGC 


Xhol 


990 


Forward. CGCGGATCCGCTAGC-TTCAGAGCTCAGCTT 


oanirxJ- 




2 


Nnel 




Reverse CCCGCTCGAG-AAACAGCCATTTGAGCGA 


Xhol 


992 


Forward CGCGGATCCCATATG-GACGCGCCCGCCCG 


DaiTLrii- 






Ndel 




Reverse CCCGCTCGAG-CCAAATGCCCAACCATTC 


Xhol 


993 


Forward CGCGGATCCCATATG-GCAATGCTGATTGAAATCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GAACACATCGCGCCCG 


Xhol 


996 


Forward CGCGGATCCCATATG-TGCGGCAGAAAATCCGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCTAAACCCCTGTTTTCTC 


Xhol 


997 


Forward CCGGAATTCTAGCTAGC-CGGCACGCCGACGTT 


EcoRI- 




Reverse CCCGCTCGAG-GACGGCATCGCTCAGG 


Nhel 




Xhol 
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Underlined sequences indicate restriction recognition sites. 

The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ED 1>: 

gOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

2 51 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 
301 CCGTCTGAAG CGATGT TGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 
351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 2; ORF 001. ng>: 

gOOl .pep 

1 MLPQGKAARR VSANEVSGRA CA RMVLVICQ TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TIITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3>: 

rnOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT ss . GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

3 01 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 
351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 001 >: 

mOOl .pep 

1 MLPQGKAARR MSANEVCGXL XA WXVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 5>: 

aOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

2 51 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 6; ORF 001. a>: 

aOOl -pep 

1 MLPQGKAARR MSANEVCGKA W AWMVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 

mOOl/aOOl 96-2% identity over a 131 aa overlap 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 

I I I M I I I I I I I I I I I I I II I I I I I II II I I I I I I I I I I I I I II I II I I I I 

aOOl . pep MLPQGKAARRMSANEVCGKAWAWMVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
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10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl . pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I M I M 1 I I I I I I II I M I I I I II I t i I I 
aOOl .pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mOOl . pep PSASGRWDKTAX 
I I I U I I t I I I 
aOOl . pep PCASGRWDKTAX 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 001 shows 89.3% identity over a 131 aa overlap with a predicted ORF (ORF 001. ng) 
from N. gonorrhoeae: 

mOOl/gOOl 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
M I II I I ! I I : I I I I I I I II I I I I I M M I I I I I I I I I I : I I I M II I I I I I 1 I 
gOOl MLPQGKAARRVSANEVSGRACARMVLVICQTLPKRDTLNGSGTHTLPVWAILPRSLRSKS 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl .pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
I | | I | | I | I I M : I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I : M I I I II I I I M I I I 
gOOl TIITFSARFFGSVCNSAARRSSCPSPKIGAVPFIGSVLMVPSEAMLRKSSGEKHSVHADC 

70 80 90 100 110 120 



130 

mOOl .pep PSASGRWDKTAX 
I :: I I I I I : I I I 
gOOl PASSGRWDNTAX 

130 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ED 7>: 

g003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCGTATTC GGTCGCGCTG TTTTGGGTCA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGCTTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAGG TAGTCCTGAT GGTATTCCTC GGCGTCGTAG 

251 AAGTTTTTCA GCGGTTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTGC CGGTGTCGTT ACCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAA TAATATTGCA GGATGTCGTC 

4 51 CAGgCTGagt TTGTCGGCAT CGTaggtcac tTTGACGGTC TCGGCATGAC 

501 CCGTATGGCG GTaggacact tctTCgtanc TcGGGtTTTC CGTGttGCCG 

551 TTGGCgttac cGGATACCGC gtcaACCACG CCGTcgatgc gttggaAATa 

601 ggCTTCCAAg ccccaaaagc agccgccggc gaagtaaatg gtgcccgtgt 

651 tcatgattGC TGa 

This corresponds to the amino acid sequence <SEQ ED 8; ORF 003. ng>: 

g003 .pep 

1 MWFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGFG 

51 FARQRFVGFA DVDV AVAVGV FNQVVLMVFL GWEVFQRFV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVGWH AAAVLRAGW TLFVEAGRIN DAEIILQDW 
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151 QAEFVGIVGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 

201 FQAPKAAAGE VNGARVHDC 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 9>: 

m003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGsTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGGG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CAGCkTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTtCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGk CGGCGATGAC GGCTTTTTCG kCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCg . CCGGC GAGGTAAATG GTGCGCGTGT 

651 TCATGATTTT TGA 

This corresponds to the amino acid sequence <SEQ ED 10; ORF 003>: 

m003.pep Length: 221 

1 MWFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 

51 FARQRFVSXA DVDV AVAVGV FNQWLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGXGDD GFFXGVG WH AAA V LRT GW ALFV EAGRIN DAEEILQDW 

151 * AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAXG EVNGARVHDF * 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 1 1>: 

a003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCATATTC GGTCGCGCTG TTTTGGGTAA 

51 CTTGGTATTG CTCTTCGGTC AGGGTGCGTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGTCTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATATCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAAG TAGTCCTGAT GGTATTCCTC GGCATCGTAG 

251 AAGTTTTTCA GCGGCTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTAC CGGTGTCGTT GCCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAG AAATATTGCA GGATGTCGTC 

4 51 TAGGCTGAGT TTGTCGGCAT CGTAGGTCAC TTTGACGGTT TCGGCGTGGC 

501 CCGTATGGCG GTAGGACACG TCTTCATAGC TCGGATTTTT CGTGTTGCCG 

551 TTGGCGTAGC CGGATACCGC GTCAACCACG CCGTCGATGC GTTGGAAATA 

601 GGCTTCCAAG CCCCAGAAGC AGCCGCCGGC GAGGTAGATG GTGCGCGTGT 

651 TCATGATTTT TGA 

This corresponds to the amino acid sequence <SEQ ID 12; ORF 003. a>: 

a003 .pep 

1 MWFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

51 FARQRFVGFA DID VAVAVGV FNQWLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRTGVV ALFV EAGRIN DAEEILQDW 

151 * AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

201 GFQAPEAAAG EVDGARVHDF * 

moo3/aoo3 95.9% identity over a 220 aa overlap 

10 20 30 40 50 60 

m003 . pep MWFVAEGIFGRAVLGNLXLLFGQGAFE FGVTRFFIRCRVEAFALRGGLG FARQRFVSXA 

t I I I I I I I I I I I I I I It I I I I I II I I I I I II I I I I I I I I II I I I I I II I I I I I I I : I 
a003 MWFVAEGIFGRAVLGNLVLLFGQGAFE FGVTRFFIRCRVEAFALRCGLG FARQRFVGFA 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m003 - pep DVDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGVVH 
I : M M II I I I It t I I I I I I II I I I I M I M I ! I I I I I I I I I I I I I I 1 I I I I MINI 
a 003 DIDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGGGDDGFFGGVGVVH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m003.pep AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARI F 

I M I M M I I I i I M I M I I I I I I 1 M I I I I M I I I II I I II I I I I I I I I I I I I I I I I I I 
a 003 AAAVLRTGWALFVEAGRINDAEEILQDWXAEFVGIVGHFDGFGVARMAVGHVFIARIF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 

II M I I t I I I I M I I I I I I I I I I I I I I I M I : 1 I I I I M I 
a 0 0 3 RVAVG VAG YR VN HAV DALE I G FQAPEAAAGEVDGARVH D FX 

190 200 210 220 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with a predicted ORF (ORF 003.ng) 
from TV. gonorrhoeae: 

m003/g003 

10 20 30 40 50 60 

m003 . pep MWFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
I I I I I I I I : 1 I I I I U : I I I M I I I I I M II I I I I I I I I I I I I I I I : I I II M I I : I 
g 003 MVVFVAEGVFGRAVLGHLVLLFGQGAFEFGVTRFFIRCRVEAFALRCGFGFARQRFVGFA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQWLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 
I I I I I I I I I I II I I I I M I I I : I I I I I I : I I I M I I I I I I I I I I I I I I I I I I MINI 
g 003 DVDVAVAVGVFNQVVLMVFLGWEVFQRFVFNNEGQLV FLLLAFEGGGDDGFFGGVGVVH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m003 . pep AAAVLRTGWALFVEAGRINDAEEILQDVVXAEFVGIVGHFDGFGVARMAVGHVFIARIF 
I | I I I I : M I : I I I II I 11 I I I I I I I I I I I I I I M I I I I I I : I M I I I I I : hi 
g 003 AAAVLRAGWTLFVEAGRINDAEIILQDWQAEFVGIVGHFDGLGMTRMAVGHFFV-RVF 

130 140 150 160 170 180 

190 200 210 220 

m003 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 

I I I I I I : I I I I M I I II I I I I I I I I : I I I I I I I I I I I I 
g003 RVAVGVTGYRVNHAVDALEIGFQAPKAAAGEVNGARVHDC 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 13>: 

g0O4 . seq 

1 ATGgtagAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATGC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

2 01 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

4 01 ggttggacgc gtcgcccgca ataAAGATAT GCGGAATgct ggtCTGCATg 

4 51 gtCAGCGGAT CGGCAACGGG tacgccgcgc gcgtctttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCgACCT ACGGCTGCCA 
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551 AC AT AT ATT C GGCAACAAAT ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtctACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAGCATCCAG 

651 ATGCAGTTTC AATtctTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

7 01 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

7 51 TTCACACCCA AACGGTGCAA TGCCTGA 



This corresponds to the amino acid sequence <SEQ ID 14; ORF 004. ng>: 

g004 .pep 

1 MVERHIQHLR NGHLHLMRPC QQVSQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLIFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDASPA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

2 51 FTPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15>: 

m0O4 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTC AC C AG CATC A CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

301 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

3 51 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 
4 51 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 
501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 
551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 
601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 
651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 
7 01 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 
7 51 ACGCCCAAAC GGTGCAATGC CTGA 



This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

m004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALSPAI LPWSAASCSR GSW LDASPAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 17>: 

a004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCCGG ACCTACGATT 

101 TCTGCGCCGA TGAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTT ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

251 TATTTCAAAC CGACAAAGCC GATTTGCGGA CTGGTGAACA CTACGCCGAT 

301 GGTGCTGCGG CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

351 GCCGGCAATC TTGCCTTGGT CGGCGGCTTC ATGCAGCAGG GGCAGTTGGT 

4 01 TGGACGCGTC GCCCGCAATA AAGATATGCG GAATACTGGT CTGCATAGTC 

4 51 AGCGGATCGG CAACGGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATGTT 

501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCTACG GCTGCCAGCA 

551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 

601 TCTACATTGC CGTCTGCGTC GAGTTTGGCC TCGGTTTTAG CATCCAAATG 

651 CAGTTTCAAT TCTTCACCGA ACACGGCTTT CGCCTCGTCT GAAACGACGG 

701 GGTCGGAAAT GCCGCCGATG ATGCCACCCA AACCGAAAAT TTCAACTTTC 

7 51 ACGCCCAAAC GGTGCAATGC CTGA 
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This corresponds to the amino acid sequence <SEQ ED 18; ORF 004. a>: 

a004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 

51 AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 

101 GAAAQTAADI RVAAALSPAI LPWSAASCSR GSWL DASPAI KICGILVCIV 

151 SGSATGTPRA SFSILMFSKP ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLA SVLASKCSFN SSPNTAFASS ETTGSEMPPM MPPKPKISTF 

251 TPKRCNA* 

m004/a004 94.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m00 4 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
II I M I M I I I II I I I I I 1 M I i M M M I : II I I I : I I t I I I I 1 11 I I I I I : I I I I I I 
a 0 0 4 MVERH IQHLRNGHLHLMC PS QQVRQM FGGRT YD FCADEAAGG FFG I QAHMAFVYQHHAAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m004 . pep ALV FERY FADDKFVGLV LRGNLRVFQT DKADLRTGKHHADGAAPQTAADI RVAAALS PAI 

I | 1 I | I | | I I I I I I I I I II I I I I I I I I I 1 I I M I I : I : I I II I I I I M I I I I I I I M I I 
a004 ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGEHYADGAAAQTAADI RVAAALS PAI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m004 . pep LPWSAASCSRGSWLDASPAMKI CGI LVCMVSGSATGTPRASFS I LI FSKP ILSTFGRRPT 

I M I I I M I I I I I M I I I !: M i I I M I : I I I I I I I I I I I I II I I : I I I I I I I M I I I I I 
a004 LPWSAASCSRGSWLDASPAIKICG I LVCIVSGSATGTPRASFS I LMFSKP ILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 

m004 . pep AASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPPM 
I I II I !■ I I I I I I I I I I I I I II I I I I M I I : M I I I : I M I I I I I I I I I I I I II I I I I I I I 
a 004 AASIYSATNTPFSPSCSQWTSTLPSASSLASVLASKCSFNSSPNTAFASSETTGSEMPPM 

190 200 210 220 230 240 

250 

mO 0 4 . pep IP PKPKI ST FT PKRCNAX 

: I I I I I I I I I I M M I M 
a004 MP PKPKI ST FT PKRCNAX 

250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004 .ng) 
from TV. gonorrhoeae: 

m004/g004 

10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
II I I I I I I I I I I II M I I III I I II I I I I I I I I I I I I I I I I I M I I I I I I : I I I I I 1 
g 004 MVERH IQHLRNGHLHLMRPCQQVSQMFGGRAYDFRADKAAGG FFG I QAHMAFVYQHHAAA 

10 20 30 40 50 60 

70 80 90 100 110 119 

m004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAA-LSPA 
: I : I! I I I I I I I I I I I I M I I I I I I I 1 ! M I I I I I I I II : I I t MINIMI! II 
g004 TLIFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHANGAAAQTAADIRVAAPRYCPA 

70 80 90 100 110 120 



m004 . pep 



120 130 140 150 160 170 179 

I LPWSAASCSRGSWLDASPAMKICG I LVCMVSGSATGTPRASFS I LI FSKPILSTFGRRP 
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I I M I I M I I I 1 M I I I I I I : I I ! I : i I I I I I I I M I I I I I I H l 1 : I I I I I I I I I I I I I 
g 004 ILPWSAASCSRGSWLDASPAIKICGMLVCMVSGSATGTPRASLSILMFSKPILSTFGRRP 

130 140 150 160 170 180 

180 190 200 210 220 230 239 

m004 .pep TAASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 

I I I : I I t I t I I I I I I I I I ! I I I I I I 1 I I I I I I I M I i M i I I 1 I I I I I I t i I I I I 

g 004 TAANIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPP 

190 200 210 220 230 240 

240 250 
m0O4 .pep MI PPKPKI ST FT PKRCNAX 

I I I I t I I I M I I I t I I I ! 
g004 MI PPKPKISTFTPKRCNA 

250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 19>: 

g005 . seq 

1 ATGGGGATGG AC AAT AT T G A TATGTTCATG CCTGAACAAG AGGAAATCCA 

51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CTGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AGTAAGAAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

2 51 AAGAGACAAA ACATCAGGAA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 G C CAAAGC AG AGAAAAAGCG TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

4 01 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

4 51 GCCAAGCCTG AAGATGAGGT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgtc gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GG AT AAAATT GTTTCCGCtc cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAATATC CAccgCctGT 

7 01 TGAAAAAACA TGATATTGAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

7 51 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGGCA AACAGAAATT 
801 CCGGCAGGAA CTGGAGGAAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

8 51 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATT GG 
901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATTGACGAGA TTTCGACCAG 
951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AATATCAGGA GAAGCGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 

This corresponds to the amino acid sequence <SEQ ID 20; ORF 005. ng>: 

g005 .pep 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA IALIV LAIVQ 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LESPGGWHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVADK I VSAPFAVIGS VGWAE VPNI HRLLKKHDID VDVMTAGEFK 

251 RTVT FMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD IEKIATGEHW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV IEVKYQEKRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21>: 

m005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAG AG G AAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

2 51 C AC AAC AT C A GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 
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301 GCAGAGAAAA A . CGTTTGAA GGAGGGTGGG GAGAAATCTG CCGAAACGCA 

3 51 nAAATCACGC CTTTTTGTGT TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGCGAGCGG CGGTTATATG 
601 ATGGCGTGTG TGGCGGATAA AATTGCTTCC GCTCCGTTTG CGATTGTCGG 
651 TTCGGTGGGT GTGGTGGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 
7 01 AAC AT GAT AT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

7 51 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

8 01 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AG C GAG AAC C 
851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 
901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 
951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 . pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSXSGSWL TDFSENYKKQ RQSFEAFFLS GEEAQHQEKE EKKKEKAEAK 

101 AEKXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

201 MACVADK IAS APFAIVGSVG VVAEV PNIHR LLKKHDIDVD VMT AGE FKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 

351 VEKLFAKLVN RRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23>: 

a005 . seq 

1 ATGGACAATA 

51 GTGGAAAGAA 

101 CCGTGTTCGG 

151 AAACAGTCGG 

201 TAAAAAACAG 

2 51 CAAAACATCA 

301 GCAGAGAAAA 

351 AAAATCCCGC 

4 01 CCGTAGAATC 

4 51 CCTGAAGATG 

501 CGGTTACGGT 

551 TTCCGCTGAC 

601 ATGGCGTGTG 

651 TTCGGTGGGT 

7 01 AACATGATAT 

7 51 GTTACTTTTA 

801 GGAACTGGAG 

851 GCCCTCAATT 

901 CGGCAGGCGT 

951 TTTGTTGTTG 

1001 AGGAGAAGCA 

1051 GTTGAAAAGT 

1101 G 

This corresponds to the amino acid sequence <SEQ ID 24; ORF 005. a>: 

aOOS.pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LIV LAIVQSK 

51 KQSESGSWL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 

101 AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EITAVLLIAK 

151 PEDEVLLRLE SPGGWHGYG LAASQLRRLR ERNI PLTVAV DKVAASGGYM 

201 MACVADK IVS APFAIVGSVG VVAEV PNIHR LLKKHDIDVD VMTAGE FKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 



TTGACATGTT CATGCCTGAA 
ATTTTACTGA ATTACGGTAT 
CGCAATTGCG CTGATTGTGT 
AAAGCGGCAG TGTCGTACTG 
CGGCAATCGT TTGAAGCATT 
GGAAAAAGAG GAAAAGAAAA 
AGCGTTTGAA GGAGGGTGGG 
CTTTTTGTGT TGGATTTTGA 
CTTGCGTCAT GAGATTACGG 
AGGTTCTGCT TAGATTGGAA 
TTGGCGGCTT CGCAGCTTAG 
CGTCGCCGTC GATAAGGTGG 
TGGCGGATAA AATTGTTTCC 
GTTGTAGCGG AAGTACCGAA 
TGATGTGGAT GTGATGACGG 
TGGGTGAAAA TACGGAAAAG 
GAAACGCATC AGTTGTTCAA 
GGATATTGAG GAAGTGGCAA 
TGGCGTTGAA CTTGATTGAC 
AAAGCGTTTG AAAACAAACA 
AAGCCTGATC CAGCGCATTG 
TGTTTGCCAA ACTTGTCAAC 



CAAGAGGAAA TCCAATCAAT 
TTTCCTGCTC GAACTGCTTA 
TGGCTATCGT ACAGAGTAAG 
ACGGATTTTT CGGAAAATTA 
CTTTTTAAGC GGGGAAGAGG 
AGGAAAAGGC GGAAGCCAAA 
GAGAAATCTT CCGAAACGCA 
CGGCGATTTG TATGCACACG 
CGGTGCTTTT GATTGCCAAG 
AGTCCGGGCG GCGTGGTTCA 
GCGTTTGCGC GAACGCAATA 
CGGCGAGCGG TGGTTATATG 
GCTCCGTTTG CGATTGTCGG 
TATCCACCGC CTGTTGAAAA 
CGGGCGAATT TAAGCGCACG 
GGCAAACAGA AATTCCGACA 
GCAGTTTGTC AGCGAGAACC 
CGGGCGAGCA TTGGTTCGGT 
GAGATTTCGA CCAGTGATGA 
GGTTATCGAA GTGAAATATC 
GTTTGCAGGC GGAAGCTTCT 
CGGCGGGCGG ATGTGATGTA 
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301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 
351 VEKLFAKLVN RRADVM* 

moos/aoos 79.2% identity over a 366 aa overlap 

10 20 30 40 50 60 

MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSWL 
I I M I I I I I I I M I I I I I I I I I I I I I t I I I I I II I It I I I I I I I I 1 II I I I I I MUM 
MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSVVL 
10 20 30 40 50 - 60 

70 80 90 100 110 120 

TDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKXRLKEGGEKSAETXKSR 
M I I I I I II It I I II I M I I I ! I I : I I I I I I I I I II II I I I I I 111111111:11 I I I 
TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAEKKRLKEGGEKSSETQKSR 
70 80 90 100 110 120 

130 140 150 160 170 180 

LFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
MM : 

LFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGWHGYGLAASQLRRLR 
130 140 150 160 170 180 

190 200 210 220 230 240 

XXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 
: II I II I II I M II I :\ M II I I I II I II I II M I II II II I I I II I 

ERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 
190 200 210 220 230 240 

250 260 270 280 290 300 

VMTAGEFKRTVT FMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 
M M II I II I I I II II I I II II II II I M I II II I I I M I II II I II I I I I II I II I I II 
VMTAGEFKRTVT FMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 
250 260 270 280 290 300 

310 320 330 340 350 360 

RQALALNLI DEI ST SDDLLLKAFENKQVIEVKYQEKQSLI QRIGLQAEAS VEKLFAKLVN 

I M I I I I II II I I I I I I M I II I II II II I II I 11 II II M I I I II I II II M I! I I II I 
RQALALNLI DEI ST SDDLLLKAFENKQVIEVKYQEKQSLI QRIGLQAEAS VEKLFAKLVN 
310 320 330 340 350 360 



RRADVMX 

I I II I M 
RRADVMX 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF 005. ng) 
from N. gonorrhoeae: 

m005/g005 

10 20 30 40 50 

m005 . pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSV 
I II I I I M II II I I I II M I I II M II I II II I II II I II I I I I I I II II I I! II I I 
g 005 MGMDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSV 

10 20 30 40 50 60 

60 70 80 90 100 110 

m005 . pep VLTDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKKRLKEGGEKSAETXK 

II I II I I II I II I I II I :\ I M I I : M I II M II I II I II I II I II II II I M II I I I 
g005 VLTDFSENYKKQRQSFETFFLSEEETKHQEKKEKKKEKAEAKAEKKRLKEGGEKSAETQK 

70 80 90 100 110 120 



m005 . pep 
a005 

m005 . pep 
a005 

m005 .pep 
a005 

m005 . pep 
a005 

m005 . pep 
a005 

m005 -pep 
a005 

mOOS .pep 
a005 
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120 130 140 150 160 170 

mOOS . pep SRLFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

MINI : 
g 005 SRLFVLDFDGDLYAHAVESLRHEITAVLLIAKPEDEVLLRLESPGGWHGYGLAASQLRR 

130 140 150 160 170 180 



180 190 200 210 220 230 

m005 . pep XXXXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGWAEVPNIHRLLKKHDID 

: M I I I f I I 1 I I I I I : I I I I I :: I I M 1 I I I I M ! I 1 I I I 1 M I I 

g005 LRERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAVIGSVGWAEVPNIHRLLKKHDID 

190 200 210 220 230 240 



240 250 260 270 280 290 

m005 . pep VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHW 
I I I I 1 I I I I I I M M I I I I I I I I I I M I I I I I 1 I I I M I I I I I I M I I I M : : I I I I M 
g005 VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPGLDIEKIATGEHW 

250 260 270 280 290 300 



300 310 320 330 340 350 

m005 .pep FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQ7VEASVEKLFAKL 
I | | | M I II t I I I I I I I I I I I I II I I I M I I I I I I I I I : M I i I I I I I I II I I I I I II I I 
g005 FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFAKL 

310 320 330 340 350 360 



360 

mO 0 5 . pep VNRRADVMX 
I I I I I I I I I 
g005 VNRRADVMX 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 25>: 

g0O6 . seq 

1 ATGCTGCTGG TGCTggaatt ttggttCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCttt TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TATTTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

401 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 26; ORF 006.ng>: 

g006 .pep 

1 MLLVLE FWFG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 
51 KGDERQLYRH YGLVSRLRVL ISNREAFGYL CVGAAMGILF GFAFVMM TLK 
101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 
151 AGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 27>: 

m00 6 . seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACC GGCGGCAGCT GTACCGCCAT TACGGACTGC TTGCGCGCCT 

2 01 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 
251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

3 01 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 
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3 51 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 
4 51 GCCGGAACTTGA 

This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

m006 . pep 

1 MLLVLE FWVG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 
51 KGDRRQLYRH YGLLARLRVL I SNREAFGYL CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 
151 AGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29>: 

a006 . seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

401 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 

4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 30; ORF 006.a>: 

a006 .pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 
51 KGDERQLDRH YGLLARLRVL I SNREAFGY L CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 
151 AGT* 

m006/a006 96.7% identity over a 153 aa overlap 

10 20 30 40 50 60 

m006 . pep MLLVLE FWVGVSAVG I LALFLWLLPRFAAISENLYFRLNNSLERDNHFIRKGDRRQLYRH 

M I I I I I I j II I I I I I I I I M I I I M I I I I I I M I I I I : I 1 I 1! i I II I I I 1 t : I I ! I I 
a 006 MLLVLE FWVGVSAVG I LALFLWLLPRFAAISENLYFRLKNSLERDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006 . pep YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAM 
I I I I I M I I I II I ! I I I I I I I I I I I I It I I I t I I I I I I II M I I I I M I I I I I M I I I t : 
a 00 6 YGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSAGHVYSVGTYLWMFAI 

70 80 90 100 110 120 

130 140 150 

m00 6 . pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I I I I I II I I I I I I I M I I I I I I I 1 : I I I I I I M 
a 006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from N. gonorrhoeae: 

zn006/g006 

10 20 30 40 50 60 

m006 . pep MLLVLE FWVGVSAVG I LALFLWLL PR FAAISENLYFRLNNSLERDNHF I RKGDRRQLYRH 
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g006 




10 20 30 40 50 60 



m006 . pep 



70 SO 90 100 110 120 

YGLLARL.RVL I SNREAFGYLCVGTAMG I LFGFAFVMMTLKG YS S AGHVYS VGT YLWMFAM 



g006 




70 80 90 100 110 120 



m006 . pep 



130 140 150 

S LDDV PRLVEQ YSNLKD I GQR I E W S ERN I KAGTX 



gll8 




LDDVPRLVEQYSNLKDI GQRI EWSERNI KAGT 
130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31>: 

g006-l.seq 



1 ATGTGGAAAA TGTTGAAACA CATAGCCAAA ACCCACCGCA AG C G AT T GAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTTTTGATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC TGGTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGT AT C CAT A TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

7 01 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

801 TTCCAATTTG AAAG AC AT CG GACAACGGAT AGAGTGGTCG GAACGGAACA 

851 TCAAAGCCGG AACTTGA 



This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-1. ng>: 

g006-l . pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 
51 Q ALLYALVVF LMWLVGAA RR IADTRTFTRI YTEIAVPWL EQRQRQVPHS 



101 AVTARVALSR EFVSFFEEHL PIA ATSWSI FGACIMLLVL EFWVGVSAVG 

151 ILALFLWLLP RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV MM TLKGYGSA GHIYSVGTYL 

2 51 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERN I KAGT* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33>: 

m006-l . seq 



1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

201 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 



WO 99/057280 



PCT/US99/09346 

177 



651 CGGCACGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

7 01 TCAAAGGCTA CAGCAGCGCG GGGCATGTCT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAQTTT GGACGACGTG CCGCGATTGG TCGAACAATA 
801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

8 51 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ED 34; ORF 006- 1>: 

m00 6-l . pep 

1 MWKMLKHIAQ THRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 

Q ALLYALVVL LMWLVGAV RR IADTRTFTRI YTEIAVPWL EQRQRQVPHS 

AVTARVALSR EFVSFFEEHL PIA ATSWSI FGACIMLLVL E FWVGVSAVG 

ILALFLWLLP RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 



51 
101 
151 
201 
251 



RLRVLISNRE AFGY LCVGTA MGILFG FAFV M MTLKGYSSA GHVYSVGTYL 
WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 



moo6-i/goo6-i 95.5% identity in 288 aa overlap 



10 20 30 40 50 60 

m00 6-l.pep MW KMLKH I AQTHRKRL I GT FSLVGLENLLMLVYPVFGGRAIN AVI AGE VW QALLYALVVL 
I M I I I M I : I I 1 I 1 I I I t I I MINIMI MINI II II M I h M MM II III: 
g00 6-l MWKMLKHIAKTHRKRLIGTFSPVGLENLLMLGYPVFGGWAINAVIAGRVWQALLYALVVF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m006-l .pep LMWLVGAVRR I ADTRTFTR I YTE I AVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
M M I M : I M I M M M I I I M I M I I II M I I I M I I M I I M M I I I M M M I I M 
g00 6-l LMWLVGAARRI ADTRTFTR I YTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m006-l .pep PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 
I M II 1,1 M I M I II I M M I I M I I M I M I M M 11 I M II I M 11 M I M II I II II 
g006-l PIAATSWSI FGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 

130 140 150 160 170 180 

190 200 210 220 230 240 

m006-l .pep NHFI RKGDRRQLYRHYG LLARLRVLI SNREAFGYLCVGTAMG I LFGFAFVMMTLKG YS S A 
M i I M M : M M M M I : : M 11 M I M M I II I M I M M M M M I I M It II I : M 
g006-l NHFIRKGDERQLYRHYGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSA 

190 200 210 220 230 240 



250 260 270 280 289 

m00 6-l .pep GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I : II II I II M 11 I 1 I II 1 I I I M I 11 M I 1 II II II M M II I I I 1 I 
g006-l GHIYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l.seq (partial) 

1 . . AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 

51 GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 

101 ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 

151 GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCGATACGCG 

201 CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 

251 AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 

301 TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 

351 ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 

4 01 TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 

4 51 TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 

501 CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 

551 TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 

601 CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 

651 CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 
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7 01 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 

7 51 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG ACATCGGACA 

8 01 ACGGATAGAG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 36; ORF 006-1. a>: 

a006-l.pep (partial) 

1 . . SQNHRKRLIG TFFLVGLENL LMLVYPVFGG WAINAVIAGQ AW QALLYALV 

51 VLLMWLVGAA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

101 SREFVSFFEE HLPIA ATSVV SI FGACIMLL VL E FWVGVSA VGILALFLWL 

151 LPRFAAI SEN LYFRLKNSLE RDNHFIRKGD ERQLDRHYGL LARLRVLISN 

2 01 REAFGY LCVG TAMGILFGFA FVM MTLKGYS SAGHVYSVGT YLWMFAISLD 

2 51 DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 

a006-l/m006-l 95.7% identity in 280 aa overlap 

10 20 30 40 50 

SQNHRKRLIGTFFLVGLENLLMLVYPVFGGWAINAVIAGQAWQALLYALWL 

: I : M I I I I M I I I I I I I II It I M I ! 1 I I I I I I M I : : I I I I I I 1 I I I I 
MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALVVL 
10 20 30 40 50 60 

60 70 80 90 100 110 

LMWLVGAARRIADTRTFTRI YTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

I I I I I I I : I I I I I I I I 1 I I M II I I I I I I M I I I I I I II I I I I I I I I I I M I I M I I I II 
LMWLVGAVRRIADTRTFTRIYTEIAVPWLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
70 80 90 100 110 120 

120 130 140 150 160 170 

P I AATSWS I FGACIMLLVLEFWVGVS AVGI LALFLWLLPRFAAI SENLYFRLKNSLERD 
I I M I I M I I I I I I I I I I I M I I I I I I II II I I M I I I I I I I I I I I I I I I I I I : I I I I I I 
P I AATSWS I FGACIMLLVLEFWVGVS AVGI LALFLWLLPRFAAI SEN LYFRLNNSLERD 
130 * 140 150 160 170 180 

180 190 200 210 220 230 

NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 

I [ I I I M I : I I I I I I I I II I 1 I I I I M I I I I M I I I I I II I I M I I I I I I I I II 

NH FIRKGDRRQLYRHYGLLARLRVLI SNREAFG YLCVGTAMG I LFG FAFVMMTLKG YS S A 
190 200 210 220 230 240 

240 250 260 270 280 

GHVYSVGTYLWMFAISLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 
M I I I I I I I I I M I : I I II I I I I I I I I IE I I I I I I I M I I : 1 I I I I I I I 
GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 37>: 

g007 . seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGcgC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat gaCaAAAGGG CAAAAAGTGT 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac tgtattatga acaAACCGCa 

201 cgTCCtgctg cacagcatgg tcaaaggcAt cgacgggaca ttcaaagtgg 

251 agcggcaaaa cctacgacgg atttatgCcc gcaaccgcca tcagcgATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 38; ORF 007. ng>: 

g007 .pep 

1 MNTTRLPTAF ILCCLCAAAS AA DNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR IYARNRHQRC 

101 GHCRRRHLYH ERL* 



a006-l . pep 
m006-l 

a006-l -pep 
m006-l 

a006-l . pep 
m006-l 

a006-l . pep 
m006-l 

a006-l . pep 
m006-l 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39>: 
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m007 . seq 

1 ATGAACACAA CCCGhCTGCC 

51 CGCCGCTTCT GCCGCCGACA 

101 ACGAATCCAA CTGCGTCGCC 

151 ACCATGTTTC CGCCGCTCTA 

201 GGTGCTGCTG CACAGCATGG 

2 51 AACGGCAAAA CCTACAACGG 

301 GGACATTGCC GCCGTCGCCA 



GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
CCGCTCCGAC TTCATCATGA AAAAACCGCA 
T C AAAGGC AT CAACGGTACA ATCAAAGTC. 
ATTCATGCCC GCAACCGCCA TCAGCGATGC 
CT TAT AT CAT GAACGCCTTT GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 .pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARNRHQRC 

101 GHCRRRHLYH ERL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41>: 

a007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG T C AAAGGC AT CAACGGTACA ATCAAAGTC. 

251 AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ED 42; ORF 007. a>: 

a007 .pep 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 

moo7/aoo7 97.3% identity over a 1 13 aa overlap 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I M M I I I M I I I I : i I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I f I M I I I I I I I I 
a 007 MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 

ra007 . pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
I I I I I M I I M 1 M I I M I I I I I I M I 1 I I I 1 I t : II I t I 1 I 1! I I 1 I I II 1 I 
a007 FIMKKPQVLLH SMVKG I NGT I KVXRQNLQRI HARHCHQRCGHCRRRHLYHERLX 

70 80 90 100 110 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 007 shows 86.7% identity over a 1 13 aa overlap with a predicted ORF (ORF 007.ng) 
from N. gonorrhoeae: 

m007/g007 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I t I I I I :: I I : I I I I I I I I I I I I I I I I I I I I 1 II : I I I I I 1 I I I I M I I I I : I I I 
g 007 MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 



70 80 90 100 110 

m007 .pep FIMKK PQVLLHSMVKGINGT I KVXRQNLQRI RARNRHQRCGHCRRRHLYHERLX 

I I : I I : I I I I II I I I I : I I : I I I I I I : I I : I I I I II I II I I I I I II I I I I I 
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g007 CIMNKFHVLLHSMVKGIDGTFKVERQNLRRI YARNRHQRCGHCRRRHLYHERL 

70 80 90 100 110 

The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 43>: 

g007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC T AT AT TAT G A ACAAACCGCA 

201 CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC . 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-l.ng>: 

g007-l.pep (partial) 

1 MNTTRLPTAF 1LCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 
51 TAFPPLFRSD YIMNKPHVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 45>: 

m007-l . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

251 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 

301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

351 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAGC AAAAAAAACT 
401 AA 

This corresponds to the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l . pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 

m007-i / g007-i 9 1 .7% identity in 133 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPFLYRSD 
I II I II I I I : : I I : I I I I I I I I I I I I I I I I I I I I I I I : I I I 11 I I I I I II I I I I : I I I 
g007-l MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 

70 80 90 100 110 120 

m007-l .pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 
: I I : I I : I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I i 
g007-l YIMNKPHVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

m007-l .pep TEKDVKQAKSKKNX 

I I I I I II II : i i I 
g0 0 7 - 1 TEKDVKQAKGKKN 

130 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47>: 

a007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
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151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

2 01 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CCACTGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC T TAT AT CAT G AACGCCTTTG ACAACGGCGG 

3 51 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAAC AAAAAA . . 

This corresponds to the amino acid sequence <SEQ ID 48; ORF 007-1. a>: 

a007-l.pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 

101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKN KK . . 

moo7-i/aoo7-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I II I I t I I I I I I 1 I : I I I II II I I I M I M I II t I M If I I I I I I I I M I M M I I i I I I 
a007-l IVINTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 80 90 100 110 120 

m007-l .pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADTAAVATYIMNAFDNGGGSV 
I I i I I II I II II I I I I I I I I I I I M I I I I I I I I II I II I I I I I I I I I I I I I I i M M I I I 
a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

m007-l . pep TEKDVKQAKSKKNX 

I I I I I I M I : I I 
a007-l TEKDVKQAKN KK 

130 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 49>: 

g008 . seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA ACCTTGACAA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TT GGCAG AAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAATACG GAAAGGTTGT CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 



This corresponds to the amino acid sequence <SEQ ID 50; ORF 008.ng>: 

g008 .pep 

1 MNNRH FAVIA LGSNLDN PAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKVVELS 

151 KRLGNQGIRL LPDR* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 51>: 

m008 . seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT TTGGCAGAAA 
4 01 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGAATTGTCA 
4 51 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 

This corresponds to the amino acid sequence <SEQ ED 52; ORF 008>: 

m008 .pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEILPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis<SEQ ID 53>: 

a008 . seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GTCTGCACCG TTTCCACCAC 

2 01 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAACCGTATC GAAGCCGATT 

2 51 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GGATTTGGAC 

3 01 ATTATCGACT T T G AC GG CAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 
351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 
4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ID 54; ORF 008. a>: 

a008 .pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SSLYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 

m008/a008 97.6% identity over a 164 aa overlap 

10 20 30 40 50 60 

MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 

I I I I I I M I I M I I I I I I I I I I I I ! i I M I I 11 I I I I I i M I f II I I i I I I I I I I I I I I I 

MNNRH FAV I ALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVG YDNQPDFVNA 
10 20 30 40 50 60 

70 80 90 100 110 120 

VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDIIDFDGISSDDTRLTLPHPR 
If I I I I I i I I t I I M I I M I I M II I I I ! I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
VCTVSTTLDG IALLAELNRIEADFGRERSFRNAPRTLDLDI I DFDGISSDDPRLTLPHPR 
70 80 90 100 110 120 

130 140 150 160 

AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
I I I I I I I I I I I I M IE It : I I I I I I I I I I M II I I I I I I I I I I : I 
AHERSFVIRPLAEILPDFILGKHGKVAELSKRLGNQGIRLLPDKX 
130 140 150 160 



m008 .pep 
a008 

m008 .pep 
a008 

m008 .pep 
a008 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORF008.ng) 
from N. gonorrhoeae: 

m008/g008 

10 20 30 40 50 60 

m008 . pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
I I I I I I I I I I I I I I I : I M I I : I : I I t : I I I I I I I I I : I : I I I I I I | I | | I I I I I I I : I I 
g008 MNNRHFAVIALGSNLDN PAQQIRGALDALSSHPDIRLEQVSSLYMTAPVGYDNQPDFINA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m008 . pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDTIDFDGISSDDTRLTLPHPR 

I I I I I I I I I I I I II I I I i I I I I M M I I II I I I I I M I I I I II I I I M I I I I M I I I I 
g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDI IDFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008 . pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 

II I I I I I I I I I M I I I 1 I : I I I : M I : M I I ! I I I I I I I I I M I I 
g008 AHERSFVIRPLAEILPDFILGKYGKWELSKRLGNQGIRLLPDRX 

130 140 150 160 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 55>: 

g009 . seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AG AG AT AG CC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009 .pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 
51 QLPLVAFSDK VWAFQAWQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 57>: 

m009 . seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

2 51 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

m009 .pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK VWAFQAWQ AEIQVFA DGG KTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009. ng) 
from N. gonorrhoeae: 

m009/g009 

10 20 30 40 50 60 

lt\009 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
I I I I I I I I I 1 I I I I I I I I I I I I I I I I M I i I I I I I I I I I I I I I : I I I I I I I I I I I I 1 I I 
g009 MPRAAVAFERH HHKSKAEQNTHRRADAE I AEG FAVGNQHTQARNQSVMAVQLPLVAFS DK 

10 20 30 40 50 60 

70 80 
m00 9 . pep VWAFQAWQAE IQV FADGGKTWQKPX 

I I I II I I I I I 1 I I If I I I I I I I I I I M 
g 0 0 9 VWAFQAWQAE I QV FADGGKT WQK PX 

70 80 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 59>: 

a009 . seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTCTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 60; ORF 009. a>: 

a009 .pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK VWAFQAVLQ AEIQVFADGG KTWQKP* 



m009/a009 97.7% identity over a 86 aa overlap 



10 20 30 40 50 60 

mOO 9 . pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
M I I 1 I M I I It I I I I I I I I I I I II I I I I M I I II I I I I II I I 11 I i I i I I I I .1 II I M 
a009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
mOO 9 . pep WVAFQAVVQAE I QV FADGGKT WQKPX 

I I I I I I I I : I I I I I I I I II M I I I I I I 
aOO 9 VW A FQ AV LQAE I QV FA DGGKT W QK PX 

70 80 



The following partial DNA sequence was 

g01O . seq 

1 ATGGGTTTTC CTGTTCGCAA 

51 TGCAGGTTTA CGTGCAGCCC 

101 CCGTTTTGTC TAAAGTGTTC 

151 GGCGGTATTT CCGCCTCTCT 

2 01 GCACATGTAC GATACCGTGA 

2 51 CGATTGAGTT TATGTGTCGC 

301 CACATGGGTA TGCCTTTTGA 

351 TCCTTTCGGC GGACATACTG 

4 01 CATGTGCGGT TGCCGACCGT 

4 51 CAACAAAACG TCCGTGCCAA 

501 AGATTTGATT CGTGATGAAA 

551 AAATGGAAAC GGGCGAAGTT 

601 GCTACCGGTG GCGGCGGTCG 

651 GAATACCGGT GACGGTTTGG 

7 01 AAGATATGGA ATTCTGGCAA 

7 51 GTGTTGATTA CCG AAGGCGT 

801 cgacggcgaA cgcTTTATGG 

851 CTTCTCGCga cgtGGTTTCA 

901 cgcggctgTG GtaaAAAcaA 

951 cggtGCAGAA AAAATTATGG 

1001 TTCagtttgc cGGTATCGAT 

1051 ACTACCCACT ATATGATGGG 

1101 TGTTGTTCCG CAAGGCGACG 

1151 CCGCAGGTGA GTGCGCCTGT 

1201 ACGAACTCCC TGCTGGACTT 



identified in TV. gonorrhoeae <SEQ ID 61>: 



GTTTGATGCC 
TCCAATTATC 
CCGACCCGCT 
GGGTAATGTG 
AAGGTTCCGA 
GCTGCGCCTG 
CCGCGTTGAA 
CCGAACATGG 
ACCGGTCATG 
TACACAATTC 
ACGGCGATGT 
TATATTTTCC 
TATTTATGCT 
GCATTTGCGC 
TTCCACCCGA 
ACGCGGCGAG 
AAcgctatgc 
CgcgcGatgG 
agaCCacgtC 
AAAAACTGCC 
CCGATTAAAG 
CGGCATTCcg 
AGTACGAAGT 
GCTTCCGTAC 
GGTGGTGTTC 



GTGATTGTCG 
CAAATCCGGT 
CGCATACCGT 
CAGGAGGACC 
CTGGCTGGGC 
AAGCGGTGAT 
AG CGG C AAAA 
TAAACGTGCG 
CGATGTTGCA 
TTTGTGGAAT 
CGTCGGCGTA 
ACGCCAAGGC 
TCTTCTACCA 
CCGTGCGGGC 
CCGGCGTGGC 
GGCGGTATTC 
GCcgACCGta 
CGatggaAAt 
TTACTGAAAA 
GGGCATCCGC 
ACCCGATTcc 
aCCAATTATC 
ACCTGTAAAA 
ACGGTGCGAA 
cgcccaaccc 



GCGGTGGCGG 
TTGAATTGTG 
AGCGGCGCAG 
GTTGGGACTG 
GACCAAGATG 
TGAGTTGGAA 
TTTATCAGCG 
GTAGAACGTG 
TACTTTGTAC 
GGACGGCGCA 
ACCGCCATGG 
CGTGATGTTT 
ATGCTTATAT 
ATTCCGTTGG 
GGGTGCGGGC 
TGTTGAacgc 
aAagaCTTGG 
ctatgaaggt 
TCGACcAtAt 
GAGATTTCCA 
ggttgTGCCG 
ACGGTGAAGT 
GGCCTGTATG 
CCGTTTGGGT 
cccggtga 



This corresponds to the amino acid sequence <SEQ ID 62; ORF 010.ng>: 

g010 . pep 
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1 MGFPVRKFDA 

51 GGISASLGNV 

101 HMGMPFDRVE 

151 QQNVRANTQF 

2 01 ATGGGGRIYA 

251 VLITEGVRGE 

301 RGCGKNKDHV 

351 TTHYMMGGIP 

4 01 TNSLLDLVVF 



VIVGGGGAGL 
QEDRW DWHMY 
SGKIYQRPFG 
FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEWVP 
RPTPR* 



RAALQLSKSG 
DTVKGSDWLG 
GHTAEHGKRA 
RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 



LNCAVLSKVF 
DQDAIEFMCR 
VERACAVADR 
TAMEMETGEV 
IPLEDMEFWQ 
KDLASRDWS 
EISIQFAGID 
GLYAAGECAC 



PTRSHTVAAQ 
AAPEAVIELE 
TGHAMLHTLY 
YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEIYEG 
PIKDPIPWP 
ASVHGANRLG 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 63>: 

mOlO.seq (PARTIAL) 

1 . . nTCCAATTAT CCAAATCCGG TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 

51 CCCGACCCGT TCGCATACCG TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 

101 TGGGTAATGT GCAGGAAGAC CGTTGGGACT GGCACATGTA CGATACCGTG 

151 AAAGGTTCCG ACTGGTTGGG CGACCAAGAT GCGATTGAGT TTATGTGCCG 

201 CGCCGCGCCT GAAGCCGTAA TTGAGTTGGA ACACATGGGT ATGCCTTTTG 

251 ACCGTGTGGA AAGCGGTAAA ATTTATCAGC GTCCTTTCGG CGGCCATACT 

301 GCCGAACACG GTAAACGCGC GGTAGAACGC GyCTGTGCGG TTGCCGACCG 

351 TACAGGTCAT GCGATGCTGC ATACTTTGTA CCAACAAAAC GTCCGTGCCA 

4 01 ' ATACGCAATT CTTTGTGGAA TGGACGGCAC AAGATTTGAT TCGTGATGAA 

4 51 AACGGCGATG TCGTCGGCGT AACCGCCATG GAAATGGAAA CCGGCGAAgT 

501 TTATATTTTC CACGCTAAAG CTGTGATGTT TGCTACCGGC GGCGGCGGTC 

551 GTATTTATGC GTCTTCTACC AATGCCTATA TGAATACCGG CGATGGTTTG 

601 GGTATTTGTG CGCGTGCAGG TATCCCGTTG GAAGACATGG AATTCTGGCA 

651 ATTCCAGCCG ACCGGCGTGG CGGGTGCGGG CGTGTTGATT ACCGAA.... 



This corresponds to the amino acid sequence <SEQ ID 64; ORF 01 0>: 

mOlO.pop (PARTIAL) 

1 . . XQLSKSGLNC AVLSKVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK IYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

151 NGDWGVTAM EMETGEVYIF HAKAVMFATG GGGRI YASST NAYMNTGDGL 

201 GICARAGIPL EDMEFWQFQP TGVAGAGVLI TE . . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

a010 . seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GAT AC C G T G A AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

701 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 

901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 C AT GAT T AAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 ATGCCGGCGA ACTGACCCGC 

1351 GATGGTGAAA ACGTTGATGC 

14 01 ATTGCACGCC GGCGTGTTCC 

14 51 GAGAAGTCAT GGCGATTGCC 

1501 AAGAGCAAAG TGTGGAATAC 

1551 CCTAATTGAA GTGGCGAAAG 

1601 AATCACGCGG TGCGCACGCT 

1651 AACTGGATGA AACATACGCT 

17 01 CAAACCGGTG CACACCAAGC 

17 51 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 66; ORF 010. a>: 

a010 .pep 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKE S RG AHA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 



CAACGTATCG 
ATTGCGCCGC 
GTACTGATGA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATGCCAATA 
GGAATACATC 



CAATCAAACT 
GCTCCGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
CGATGATGAA 
CCTTGTCCTA 
AAACCGGCCA 



moio/aoio 98.7% identity over a 231 aa overlap 

10 20 30 

mOlO.pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

M I I I II I I I I M I I I I I I II I I I II I I I I I I ill 
a010 MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
I I I I I I I I I I I I I I I I I II I I I 1 I I I II I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I 
a010 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

I I I I I I I I I I I I I II M I I I I I I I I M I I I I 1 I I! I I I I I I I I t I I I I I I I I I I I I I I I 
aOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO.pep TAMEMETGEVYIFHAKAVMFATGGGGRI YASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

I I I I I II I I I I I I I 11 I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I t I I t I I I I 
aOlO TAMEMETGEVYIFHAKAVMFATGGGGRI Y AS S TN A YMNTG DGLGICARAG IPLEDMEFWQ 

190 200 210 220 230 240 

220 230 
mO 10 . pep FQPTGVAGAGVLITE 
I : I I I I M II 1 I II I 

aOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDVVSRAMAMEIYEG 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from N. gonorrhoeae: 

mOlO .pep/g010 .pep 

10 20 30 

mOlO . pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I I I I II M I M I I I I I I I I I 1 I I I I I M I I I Ml 
g010 MGFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
I I I I M M II II I I I I M t i II I I I I I I I I I II I I I I I I I II I I I I I I M I I I M II I II 
gOlO QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

I I I 1 I I I I I I II I I M I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO.pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

I I II I I M I I I I I I I I 1 I I I I I I II I I I I II II I I I I I II I I I I II I I I I I I II I I I I II 
gOlO TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 

190 200 210 220 230 240 

220 230 
mOlO . pep FQPTGVAGAGVLITE 
I : I I I M I I I I I M I 

gOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAME.IYEG 

250 260 270 280 290 300 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 67>: 

g010-l . seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

4 51 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

7 01 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CGATGGAAAT CTATGAAGGT 

901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA TCGACCATAT 

951 CGGTGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 
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12 01 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-l.ng>: 

gOlO-l .pep 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRW DWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

2 51 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDWS RAMAMEIYEG 
301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

3 51 TTHYMMGGIP TNYHGEVWP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF RPTPR* 

g010-l / P10444 

sp| P10444 |DHSA_ECOLI SUCCINATE DEHYDROGENASE FLAVOPROTEIN SUBUNIT 

gnl| PID|dl015210 (D90711) Succinate dehydrogenase, flavoprotein [Escherichia coli] gi | 1786942 
(AE000175) succinate dehydrogenase flavoprotein subunit [Escherichia coli] Length = 588 

Score = 1073 (495.6 bits). Expect = 6.7e-169, Sum P(2) = 6.7e-169 

Identities = 191/303 (63%) , Positives = 238/303 (78%) 

Query: 1 MGFP\^KFDAVIVXXXXXXXXXXXXXSKSGI^C^VLSKVFPTRSHTVAAO^GISASLGNV 6 0 

M PVR+FDAV++ S + SG CA+LSKVFPTRSHTV+AQGGI + +LGN 

Sbjct: 1 MKLPVREFDAVVIGAGGAGMRAALQISQSGQTCALLSKVFPTRSHTVSAQGGITVALGNT 60 

Query: 61 QEDRWDWHMYD T VKG S DWLGDQDA I E FMC RAA P E A V I E LEHMGM P FDR VESG K I YQR P FG 12 0 

ED W + WHMYDTVKG S D + +GDQDA I E +MC + PEA++ELEHMG+PF R++ G+IYQRPFG 
Sbjct: 61 HEDNWEWHhTYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFG 120 

Query: 121 GHT AE HG KRAVERACA V ADRTGHAMIiHTL YQQNVRANTQ FFVEWTAQDL I RDENGDWGV 180 

G + G R A ADRTGHA + LHT L YQQN + + +T F EW A DL+++++G WG 

Sbjct: 121 GQS KN FGG EQ AART AAAADRTGHALLHT L YQQNLKNHTT I F S E WY ALD LVKNQDG A WG C 180 

Query: 181 TAMEMETGEVYIFHAKAVMFATGGGGR I YASSTNAYMNTGDGLG I CARAG IPLEDMEFWQ 24 0 

TA+ +ETGEV F A+A + ATGG GRIY S+TNA++NTGDG+G+ RAG+P++DME WQ 
Sbjct: 181 TALC I ETGEWYF KARATVLATGG AG R I YQS TTN AH I NTGDG VGMA I RAG VP VQDM EMWQ 24 0 

Query: 241 FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDVVSRAMAMEIYEG 300 

FH PTG + AGAGVL + TEG RGEGG LLN GERFMERYAP KDLA RDW+R++ +EI EG 
Sbjct: 241 FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDWARSIMIEIREG 3 00 



Query : 
Sbjct : 



3 01 RGC 3 03 

RGC 
301 RGC 303 



Score = 249 (115.0 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 
Identities = 53/102 (51%), Positives = 62/102 (60%) 

Query: 309 HVLLK I DH I G AE K I ME KLPG I RE I S I QF AGXXXXXXXXXXXXTTHYMMGG I PTNYHG E W 368 

H LK+DH+G E + +LPGI E+S FA T HYMMGGIPT G+ + 

Sbjct: 310 HAKLKLDHLG KE VLE SRLPGILE LS RT F AHVD P VKE P I PVI PTCHYMMGG I PTKVTGQAL 36 9 

Query: 369 VPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWF 410 

+V V GL+A GE AC SVHGANRLG NSLLDLWF 
Sbjct: 370 TVNEKGEDWVPGLFAVGEIACVSVHGANRLGGNSLLDLWF 411 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 69>: 

m010-l . seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG 

51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGGT 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC 

2 01 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC 
251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT 
301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA 

3 51 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG 

4 01 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA 



GCGGTGGTGG 
CTGAATTGTG 
AGCGGCGCAg 
GTTGGGACTG 
GACCAAGATG 
TGAGTTGGAA 
TTTATCAGCG 
GTAGAACGCG 
TACTTTGTAC 
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451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



CAACAAAACG 
AGATTTGATT 
AAATGGAAAC 
GCTACCGGCG 
GAATACCGGC 
AAGACATGGA 
GTGTTGATTA 
CGACGGCGAA 
CTTCTCGCGA 
CGCGGCTGCG 
CGGCGCAGAA 
TTCAGTTCGC 
ACTACCCACT 
TGTCGTTCCG 
CGGCAGGTGA 
ACCAACTCCC 
CAT GAT T AAA 
ATGCAGGTGA 
GATGGTGAAA 
ACTGCACGCC 
GAGAAGTCAT 
AAGAGCAAAG 
CCTGATTGAA 
AATCACGCGG 
AACTGGATGA 
CAAACCGGTG 
AGCGCGTTTA 



TCCGTGCCAA 
CGTGATGAAA 
CGGCGAAGTT 
GCGGCGGTCG 
GATGGTTTGG 
ATTCTGGCAA 
CCGAAGGCGT 
CGCTTTATGG 
CGTTGTTTCC 
GTAAAAACAA 
AAAATTATGG 
CGGTATCGAT 
AT AT GAT GG G 
CAAGGTGAAG 
GTGCGCTTGT 
TGTTGGACTT 
TTCATCAAAG 
GTTGACCCGC 
ACGTTGATGC 
GGCGTGTTCC 
GGCGATTGCC 
TGTGGAATAC 
GTGGCGAAAG 
TGCGCACGCT 
AACATACGCT 
CACACCAAGC 
TTGATGA 



TACGCAATTC 
ACGGCGATGT 
TATATTTTCC 
TATTTATGCG 
GTATTTGTGC 
TTCCACCCGA 
ACGCGGCGAG 
AACGCTATGC 
CGCGCGATGG 
AGACCATGTC 
AAAAACTGCC 
CCGATTAAAG 
CGGCATTCCG 
ATTACGAAGT 
GCTTCCGTAC 
GGTGGTATTC 
AGCAAAGCGA 
CAACGTATCG 
ATTGCGTCGC 
GTACTGATGA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



TTTGTGGAAT 
CGTCGGCGTA 
ACGCTAAAGC 
TCTTCTACCA 
GCGTGCAGGT 
CCGGCGTGGC 
GGCGGTATTC 
GCCGACCGTA 
CGATGGAAAT 
TTACTGAAAA 
GGGCATCCGC 
ACCCGATTCC 
ACCAATTACC 
GCCTGTAAAA 
ACGGTGCGAA 
GGTAAAGCTG 
CTGGAAACCT 
AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATATCAATA 
GGAATACATC 



GGACGGCACA 
ACCGCCATGG 
TGTGATGTTT 
ATGCCTATAT 
ATCCCGTTGG 
GGGTGCGGGC 
TGTTGAATGC 
AAAGACTTGG 
CTACGAAGGT 
TCGACCATAT 
GAGATTTCCA 
CGTTGTGCCG 
ACGGCGAAGT 
GGTCTGTATG 
CCGCTTGGGT 
CCGGCGACAG 
TTGCCTGCTA 
CAACCAAACC 
GCTCTGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
C G AT GAT G AA 
CCTTGTCCTA 
AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1>: 

m010-l .pep . . 

MGFPVRKFDA VIVGGGGAGL 
GGISASLGNV QEDRWDWHMY 
HMGMPFDRVE SGKIYQRPFG 
QQNVRANTQF FVEWTAQDLI 
ATGGGGRIYA SSTNAYMNTG 
VLITEGVRGE GGILLNADGE 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



LNCAVLSKVF PTRSHTVAAQ 
DQDAIEFMCR AAPEAVIELE 
VERACAVADR TGHAMLHTLY 
TAMEMETGEV YIFHAKAVMF 
IPLEDMEFWQ FHPTGVAGAG 
KDLASRDWS RAMAMEIYEG 



RAALQLSKSG 
DTVKGSDWLG 
GHTAEHGKRA 
RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 

RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 
TTHYMMGGIP TNYHGEVWP QGEDYEVPVK GLYAAGECAC ASVHGANRLG 
TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 
DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 
KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 
NWMKHTLYHS DTNTLSYKPV HTKPLSVEYI KPAKRVY* 



mOlO-1 / g010-l 99.5% identity in 410 aa overlap 

10 20 30 40 50 60 

m010-l . pep MGFPVRKFDA VTVGGGGAGLRAAliQL S KSGLNCAVL S KVFPTRS HTVAAQGG I SASLGNV 
I I 1 I I I I I I i I I I I I I I I I I I I I I I I I 1 I I I 1 I I 1 I I I I I I 1 I I I M I I I I I I I I I I I I t 
gO10-l MGFP VRKFD AVT VGGGGAGLRAAIjQL S KS GLNCAVL S KVFPTRS H TVAAQGG I S AS LGNV 

10 20 30 40 50 60 

70 60 90 100 110 120 

mO 1 0 - 1 . pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVXELEHMGMPFDRVESGKIYQRPFG 
I I | I I I I t I I I I I M I I I I I 1 I I 1 I I I I I I I I I I I I I I I I I I I I I I t I I I 1 I I I I I I I I I 
gO 1 0 - 1 QEDRWDWHMYDTVKG SDWLGDQDAIEFMCRAAPEAVI ELE HMGMPFDRVE SGKI YQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m010-l - pep GH TAEH GKRAVERAGAVJ^RTGHAMLHTLYQQNVRANTQFFVEVn?AQDL I RDENGDWGV 

I I I I I I I I I I I I I It I I I I I I I I t I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I ! I I I 
gO 10 - 1 GHT AE H GKRAVERACAVADRTGHAMLHTL YQQNVRANTQFFVEWTAQDL 1RDENGD WGV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m010-l .pep T AMEMETGEVY I FHAKA VMF AT GGGGR I YAS S TNAYMNT GD GLG I CARAGI PLEDME FWQ 

I I f I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I 
g010-l TAMEME TGEVY I FHAKAVMFATGGGGRI YAS S TNAYMNTGD GLG I CARAG I PLEDME FWQ 
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190 200 210 220 230 240 

250 260 270 280 290 300 

FH PT GVAGAGVL I TE GVRGE GG I LLNAD GERFMERY APTVKDLAS RD WS RAMAME I YE G 
I I I t | t I I I I I I I I I I I I t I I I I I I t I I I i II I I I I I I I I I t I I I I I I I I I I I I I I I M I 
FHPTGVAGAGVLITE GVRGE GGIIXHADGERFMERYAFTVKDIASRDWSRAMAMEIYEG 
250 260 270 280 290 300 

310 320 330 340 350 360 

RGCGKNKDHVLLKIDH I GAEKIMEKLPGIRE I S IQFAGIDPIKDPI PWPTTH YMMGGI P 

I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I M I I 

RGCGKNKDHVLLKIDH I GAEKIMEKLPGIRE I S IQFAGIDP I KDP I PWPTTH YMMGGI P 
310 320 330 340 350 360 

370 380 390 400 410 420 

TNYHGEVV\fPQGEDYEVPVKGLYAAGECACASVHGANRLGTNSL 

I I I I I I I I I I I I :: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TNYHGEVVVPQGDEYXVPVKGLYAAGECACASVHGANRLGTNSIiDLVVFRPTPRX 
370 380 390 400 410 

430 440 450 460 470 480 

F I KEQ SDWKPLP ANAGE L TRQR I E RLDNQTD GENVD ALRRE LQRS VQL HAGVFRTD E I L S 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71>: 

a010-l . seq. . 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

7 01 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 
801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

8 51 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 
901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 
951 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 

1051 ACTACCCACT AT AT GAT G GG CGGTATTCCG ACCAACTACC ATGGCGAAGT 

1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 

1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 

1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 

1251 CATGATTAAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 

1301 ATGCCGGCGA ACTGACCCGC CAACGTATCG AGCGTTTGGA CAATCAAACT 

1351 GATGGTGAAA ACGTTGATGC ATTGCGCCGC GAACTGCAAC GCTCCGTACA 

14 01 ATTGCACGCC GGCGTGTTCC GTACTGATGA GATTCTGAGC AAAGGCGTTC 

14 51 GAGAAGTCAT GGCGATTGCC GAGCGTGTGA AACGTACCGA AATCAAAGAC 

1501 AAGAGCAAAG TGTGGAATAC CGCGCGTATC GAGGCTTTGG AATTGGATAA 

1551 CCTAATTGAA GTGGCGAAAG CGACTTTGGT GTCTGCCGAA GCACGTAAAG 

1601 AATCACGCGG TGCGCACGCT TCAGACGACC ATCCTGAGCG CGATGATGAA 

1651 AACTGGATGA AACATACGCT GTACCATTCA GATGCCAATA CCTTGTCCTA 

17 01 CAAACCGGTG CACACCAAGC CTTTGAGCGT GGAATACATC AAACCGGCCA 

17 51 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 72; ORF 010-l.a>: 

a010-l .pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 



m010-l . pep 
g010-l 

m010-l -pep 
g010-l 

m010-l .pep 
g010-l 

m010-l .pep 
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151 
201 
251 
301 
351 
401 
451 
501 
551 



QQNVRANTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLWF 
DGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDHIGAE 
TNYHGEVVVP 
GKAAGDSMIK 
ELQRSVQLHA 
EALELDNLIE 
DANTLSYKPV 



RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 
FIKEQSDWKP 
GVFRTDEILS 
VAKATLVSAE 
HTKPLSVEYI 



m010-l / a010-l 99.3% identity in 587 aa overlap 



TAMEMETGEV 
IPLEDMEFWQ 
KDLASRDWS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
ARKES RGAHA 
KPAKRVY* 



YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEI YEG 
PIKDPIPVVP 
ASVHGANRLG 
QRIERLDNQT 
ERVKRTEIKD 
SDDHPERDDE 



a01O-l .pep 



a010-l 



40 



50 



60 



10 20 30 

MGFPVRKFDAVI VGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGG I SASLGNV 

lllllllllllllllllllll! IIMIIIIIIIIIIIIMIIIIIIIIimillllll 

MG F PVRKFDAVIVGGGGAGLRAALQLS KS GLNCA VLS KV F PTR SHTVAAQGG I S AS LGNV 
10 20 30 40 50 60 



70 80 90 100 110 120 

a010-l pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

IIIMIIIIIMIIIIIIIIIIIIIIINIIMIIIIIIllllllllllllilllllMI 

m 01 0 - 1 QEDRWDWHMYDTVKGSDWIiGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

a010-l pep GHT AEHG KRAVE RACA VAJDRTGHAMLHT L YQQNVRANTQ F FVEWT AQD LI RD ENGDWGV 

IIIMMIIIIIIIIIIIIIIIIIMIIII1IIIIIIIMIIMIIIIIIIIIMMIII 

mO 1 0 - 1 GHT AEHG KRAVE RAGAVADRTGHAMLHT L YQQNVRANTQF FVE WT AQD L I RDENGDWGV 

130 140 150 160 170 180 



a010-l -pep 
m010-l 



190 200 210 220 230 240 

TAMEMETGEVY I FHAKAVMFATGGGGRI YAS STNAYMNTGDGLG I CARAG I PLEDMEFWQ 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIMIIIII 

TAMEMETG EVYI FHAKAVMFATGGGGR I YAS STNAYMNTGDGliG I CARAG I PLEDMEFWQ 
190 200 210 220 230 240 



250 260 270 280 290 300 

a010-l pep FHPTGVAGAGVLITEGVKGEGGILI^ADGERFMERYAPTVXDIASRDWSRAMAMEIYEG 

IlilllllllMHIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIMIIIIIIIIIII 

m oiO - 1 FHPTGVAGAGVXjITEGVKGEGGILI^ADGERFMERYAPTVTOJlASRDVVSRAMAMEIYEG 

250 260 270 280 290 300 

310 320 330 340 350 360 

a010-l pep RGCGKNKSHVTaLKIDHIGAEKIMEKliPGIREISIQFAGIDPIKDPIPWPTTHYMMGGIP 

IIIIIIII11IMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIII 

m 010-l RGCG KN KDHVLLK I DH I G AE K I ME KLPG I RE I S I QFAG I D P I KD P I P WPTTHYMMGG I P 

310 320 330 340 350 360 

370 380 390 400 410 420 

aOlO-l.pep TNYHGE WV PQGDEYEVPVKGLYAAGECACAS VHGANRLGTNSLLDLWFGKAAGDSM I K 

I MMI I II I I M- M II III IMIIII MM IMM IMIM MMMIII I I I I I M 

m0 10 - 1 TNYHGE VWPCCEDYEVPVKGLYAAGECACASVHGANRLGTNSLLDLVVFGKAAGDSM IK 

370 380 390 400 410 420 

430 440 450 460 470 480 

a010-l pep FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 

I II Ml Ml I I II I I II II I II MM II IIMI I III Mil! II I II I I INI II I I I M 

m010 -1 FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 

430 440 450 460 470 480 

490 500 510 520 530 540 

aOlO-l pep KG VREVMA I AER VKRT E I KDKS KVWNT AR I EAL E LDNLj I EVAKAT LVS AEARKE S RGAHA 

MUM Mill MINI MMIIIMM MIMMIMIMI MIIIMIIIIMI III I 

m OlO - 1 KGVREVMAIAERVKRTEIKD KSKVWNTARI EALELDNLI EVAKATLVSAEARKES RGAHA 

490 500 510 520 530 540 



550 560 570 580 

aOlO-l -pep SDDHPERDDENWMKHTLYHSDANTLSYKPVHTKPLSVEYIKPAKRVYX 

lllllllllllllllllllll Mi III Mill I MMI MMM III 
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mO 1 0 - 1 SDDHPERDDENWMKHTLYHSDINTLSYKPVHTKPLSVEYI KPAKRVYX 

550 560 570 580 

The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 73>: 

gOll . seq 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

2 51 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

351 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

4 01 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 

501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 

This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 l.ng>: 

gOll.pep 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKTRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMWLKTR LAGKADMGEV NKILKTVLTA * 

The following partial DNA sequence was identified in K meningitidis <SEQ ED 75>: 

mO 1 1 . seq (partial ) 

1 ATGAGGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

201 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

2 51 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

4 01 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

451 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. . . 

This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mOll.pep (partial) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 l.ng) 
from TV. gonorrhoeae: 

mOll/gOll 



raOll -pep 
gOll 



mOll . pep 
gOll 



10 20 30 40 50 60 

MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 

t : I I M I I I M I I I I I I I! I I I 1 I I M I I I I I I I I I I I I I I I I I I I 11 I I 1 I I I I I I I I 
MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTAMRAKDQVSLGTIRL 
10 20 30 40 50 60 

70 80 90 100 110 120 

INAAVKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 
I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I II I : I I I I 
INAAVKQ FE VDERTEADDAKI T AI LTKMVKQRKDGAK I YTE AGRQDLADKENAE I DVLHR 
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70 80 90 100 110 120 

130 140 150 160 

mOl 1 . pep YLPQMLSAGEIRTEVEAAVAETGAAGMADMGKVMGLLKTRLAGKA 
I I I M I I I I I M I M I I M I I I I I I I I I I I I I i : I I I I I I ! I I 
gOll YLPQMLSAGEIRTAVEAAVAETGAAGMADMGKVMWLKTRLAGKADMGEVNKILKTVLTA 

130 140 150 160 170 180 

gOll X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 77>: 

g012 . seq 

1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcgac ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

g012 .pep 

1 MLARRYFFNI QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 79>: 

m012 . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

2 01 GGCGGTGGAT ATTCGGTACT TCCGCCACCA CACCCACCGA ACCGACAATC 

2 51 GCAAACGGAG CGGAAGCAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 GCCGCTCGCn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

351 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 51 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

501 nnnnnnnnnn nnnnnnnnnC AACACAAAAA GGCGTGATTT nTGCGTTTCG 

551 GCAGATTTCT CCCCACCCTC CTTCAAACGT TTTTCcTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTGT GCCTCTTCCC 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012 .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXQHKKA* F XRFGRFLPTL LQTFFLCFGF 

201 RLFLFLFLFF LMLCLFPA* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ED 81>: 

a012.seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC CAATTGAGGG CGGTTCTCGC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGTCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACA 

201 GGCGGTGGAT ATTCGGTACT TCCGCTACAA CACCCACCGA ACCGACAATC 

2 51 GCAAACGGAG CGGAAACAAT TTTATCCGCC ACACACGCCA TCATATAACC 

301 ACCGCTCGCC GCCACCTTAT CGACGGCGAC GGTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT AAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GAAGATTTCT CCCCACCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC CTGATGTTTT GCCTCTTCCC 

651 CGCTTAA 



This corresponds to the amino acid sequence <SEQ ID 82; ORF 012.a>: 

a012 .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 

101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA* 



mOi2/aoi2 64.2% identity over a 21 8 aa overlap 

10 20 30 40 50 60 

m012 .pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I M I I M I I I I I I II I M I II I I I I i I I M I M I I M I I I 1 I M I I II I I I II I I I \ I 
a012 MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 



70 80 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 
I I I I I I I I I II 1 II I : : I I M I I I I I I I : I M I I I I I II I : I I : 
a012 NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

IU012 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: : : : Milt I 

a012 PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 



190 200 210 219 

ItlO 12 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I : I i I I I I 
a 012 LR FGR FL PT LLQT L FLC FG FRL FL FL FL F FLM FC L F PAX 

190 200 210 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 012 shows 58.7% identity over a 218 aa overlap with a predicted ORF (ORF 012.ng) 
from TV. gonorrhoeae: 

m012/g012 

10 20 30 40 50 60 

m012 . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I I : I : I I I I I :: I I I I I I I I I I I I I I I I I II I I I I II I M I I I I I I I I I I J I I I 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 
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10 20 30 40 50 60 

70 80 . 90 100 110 120 

m012 . pep NIMFFQQAVDIRYFRHHTHRT DNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 

t | I I I I 1 1 I I I I : I 11 I i I I II : 1 I I I I : I I I I 1 M M I : i I : 
g 012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 1 2 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAX F 

: : : : I I I I I I 

g 012 PKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

m0l2 . pep XRFGRFLPTLLQTFFLCFG FRLFLFLFLFFLMLCLFPAX 

I t I 1! I I : I I 1 I : I I I I ! I M I I I I I : I I 11 : II I II 
g 012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

m012-l . seq 

1 ATGCTCGCCC GTTGCCACTT 

51 TGACAAACTG CTTGAACAAC 

101 TTCTGTTTGC CCTTTTCCGT 

151 AAATTCGCCC GCCGTCATCA 

201 • GGCGGTGGAT ATTCGGTACT 

251 GCAAACGGAG CGGAAGCAAT 

301 GCCGCTCGCC GCCACCTTAT 

351 CGCGCAAACG CyTAAGCTGC 

4 01 CCGCCCGGAC TTTCCAATCT 

4 51 CAAAAGCACC GCCGTAATCT 

501 ACAAATCGCC GTCAAAATCC 

551 GCAGATTTCT CCCCACCCTC 

601 CGCCTTTTCC TTTTTCTTTT 

651 CGCTTAA 



identified in N. meningitidis <SEQ ID 83>: 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCCACCA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
GAGCAGAACC 
CAT G AC G C AA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 84; ORF 012-1>: 

m012-l .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARRHLIDGD GQRNIAFAQT XKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLC FGF 
201 RLFLFLFLFF LMFCLFPA* 



m012-l/g012 



91.7% identity in 218 aa overlap 



10 20 30 40 50 60 

m012-l.pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
INI : I : II I I I :: II I I I II I I I I I I I M I M M I I M I M II i I M I I I I II I I 
g 012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m012-l - pep NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 
I II II I I I I I II : II I I II I I I : I I II I : M M I I I I I I : M I M I I I M I I I M M I 
g 012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m012-l . pep XKLRSRQTVT VNHAART FQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
M I II M M I I I M I I I I I I M I II I I M II I I I I I I I I I I II I II I I M I I I I I I I I I 
g012 PKLRSRQTVT VNHAART FQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
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130 



140 



150 



160 



170 



180 



190 200 210 219 

m012-l .pep LRFGRFLPTLLQTLFLCFG FRLFLFLFLFFLM FCLFPAX 
M I I M I I : I I I I I I I II I II I I I I I I : I I I I I I I I II 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

a012-l . seq 

1 ATGCTCGCCC GTTGCCACTT 

51 TGACAAACTG CTTGAACAAC 

101 TTCTGTTTGC CCTTTTCCGT 

151 AAATTCGCCC GCCGTCATCA 

2 01 GGCGGTGGAT ATTCGGTACT 

251 GCAAACGGAG CGGAAACAAT 

301 ACCGCTCGCC GCCACCTTAT 

351 CGCGCAAACG CCTAAGCTGC 

4 01 CCGCCCGGAC TTTCCAATCT 

4 51 CAAAAGCACC GCCGTAATCT 

501 ACAAATCGCC GTCAAAATCC 

551 GAAGATTTCT CCCCACCCTC 

601 CGCCTTTTCC TTTTTCTTTT 

651 CGCTTAA 



identified in N. meningitidis <SEQ ID 85>: 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCTACAA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
AAGCAGAACC 
CATGACGCAA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
GTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-1 .a>: 

a012-l .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 
51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 
101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 
151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 R LFLFLFLFF LMFCLFPA * 

a0l2-l/m012-l 97,2% identity in 218 aa overlap 

10 20 30 40 50 60 

a012-l . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I M I I I 
m012-l MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

a012-l .pep NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLIDGDGQRNIAFAQT 
M I I II M I I I I I I I : : I I I I I I I M M : I I! I M I I! I I : I I I I I 1 I M I II I M I I M 
m012-l NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLIDGDGQRNIAFAQT 

70 80 90 100 110 120 

130 140 150 160 170 180 

a012-l . pep PKLRSRQTVT VNHAART FQSKQNLIFRLGNQKHRRNLMTQG FYGVCIQIAVKIQHKKAGF 
I I I I I M I I II I I I I I I I I : I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I II 
m012-l XKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHRRNLMTQG FYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 



190 200 210 219 

a0l2-l . pep LRFGRFLPTLLQTLFLCFG FRLFLFLFLFFLM FCLFPAX 
I I I I I 1 I I I I I I I I I I I I I I I II I II I II I I I I I I M M 
m012-l LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 87>: 

g013 . seq 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 

51 gtCcgaccGG AAAagcggAG GAAaCGCAGT GCCGCGCCCT TCCCCTTTCT 

101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtC 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTtcat tGgcgTTTCC cctaatgttt 

251 tgaaaccttg ttttttgatt Ttgcctttac ggggtgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

g013 .pop 

1 MPLTMLCSRT CGLFIIQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 
101 PKSGI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT T CAT G AT G AA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

251 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 01 3>: 

m013 .pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQKQPKTRAV GSR VVFIGVS FMFETLLLI L RSGXKI FLPN 
101 Q* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91>: 

a013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT T C AT GAT G AA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

2 51 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

301 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013. a>: 

a013 . pep 

1 MPLTMLCSST CGFFMMKSER *SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 
51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KI FLPN 
101 R* 

mOi3/aoi3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013.pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYT FCPFVMMLLSAAEAA 

M I II I I I I I ! I I M I I II I I I I I I I I I I I I 1 M I I I I 1 I I I I I I I I i I I I I I I M j I I I 
a013 M P LTM LC S ST CGFFMMKSERXSGGNMV PR PS PFLPWQATQLDSYT FCPFVMMLLSAAEAA 

10 20 30 40 50 60 

70 80 90 100 

m013.pep AQKQPKTRAVGSRWFIGVSFMFETLLLILRSGXKIFLPNQX 
j I : I I I I I I I 1 It I I I I I I I : I I I I I I I I I I I I I t I I I I I : I 
a0l3 AQRQPKTRAVGSRWFIGVSLMFETLLLILRSGXKIFLPNRX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 
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m013/g013 

10 20 30 40 50 60 

m013 . pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPSPFLPWQATQLDSYTFCPFVMMLLSAAEAA 
I I I M I I I I I I : I : : : I : I I I I I I I I i I I I I I I I I I M M I I I I I I I I I I I I I I I I 
g 013 MPLTMLCSRTCGLFI IQSDRKSGGNAVPRPS PFLPWQAMQLDS YTFCPFVMMLLSAAEAA 

10 20 30 40 50 60 



70 80 90 100 

m013 .pep AQKQPKTRAVGSRVVFIGVSF-MFETLLLILR-SGXKI FLPNQX 

I I : I I : I I I I I M I I I I i : : : : I II I I : I 1 : 
g 013 AQRQHKMKAVGSRVVFIGVSPNVLKPCFLILPLRGEKFFWPKSGIX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 93>: 

g015 . seq 

1 ATGCAGT AT C TGATTGTCAA ATACAGC CAT CAAATCTTCG TT AC CAT C AC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AAT C C AG AAA 

101 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC GCCTACATCG 

251 CACTGGGCAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

3 01 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCAT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 94; ORF 015.ng>: 

g015 .pep 

1 MQYLIVKYSH Q IFVTITILV FNIRFFLL WK NPEKPLVGFW KAL PHLNDTM 

51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLF AY I ALGMVMM RAR PRSTKFY 

101 TVYLLAMCCI AC I VYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 95>: 

m0l5.seq (partial) 

1 . . AAAATCAGAA AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACACCATG CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

201 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

251 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 

mOlS.pep (partial) 

1 . . KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGT KILLLL 
51 AY I ALGMMMM RARPRSTKFY TVYLLAMCCV AC I VYLA KTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 97>: 

a015 . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGC CAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACCATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

2 51 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCCTC ACCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 98; ORF 015.a>: 

a015 . pep 
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1 MQYLIVKYSH Q IFVTITILV FNIRVFXL WK NPEKPLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AY I ALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCIVYLA KTK VLPF* 

mois/aois 96.7% identity over a 91 aa overlap 



10 20 30 

m015 . pep K I RKALAG FW KAL P H LNDTM LLFTGLWLMK I TH 

i 1) I II ! I I I I I i If I I I I I I I 11 I I I II 
a015 LIVKYSHQIFVTITILVFNIRVFXLWKNPEKPLAGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 



40 50 60 70 80 90 

m015 .pep FSPFNAPWLGTKILLLLAYIALGiynylMMRARPRSTKFYTVYLIJUyiCCVACIVYLAKTKVLP 
M I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I :: I I M I I I i I I M 
a015 FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCLTCIVYLAKTKVLP 

70 80 90 100 110 120 



mOlS.pep FX 
I I 

a015 FX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from N. gonorrhoeae: 

m015/g015 

10 20 30 

KIRKALAGFWKALPHLNDTMLLFTGLWLMKITH 

1 hllllllllllllMIIIIMIIIMI 
LIVKYSHQIFVTITILVFNIRFFLLWK^PEKPLVG™KALPHLNDTMLLFTGLWLJ4KITH 
10 20 30 40 50 60 



m015 . pep 
9015 



40 50 60 70 80 90 

m015 . pep FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTO 

MM MM II I IMIIMIMMMMMII III III MIMI IMM I IIMM I I II 

g015 FSPFNAPWLGTKILLLFAYIALGMVMMRARPRSTKFYT\TYLLAMCCIACIVYLAKTKVLP 
70 80 90 100 110 120 



m015.pep FX 
II 

g015 FX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 99>: 

g018 . seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tctttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tCtACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CAT C CAG AT G CAGTTTCAAT tCtTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

g018 .pep 

1 MQGGQLVGRV ARNKDMRNAG LHGQRIGNGY AARVFVDIDV FQTDIVNVRT 

51 ATYGCQHIFG NKY AFFAILL PMDFYIAVCV EFDLGFSIQM QFQFFSEHGF 
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101 RLV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 101>: 

m018 . seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 



This corresponds to the amino acid sequence <SEQ ID 102; ORF 01 8>: 

m0l8 . pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARIFFDIDI FQTDIVNVRT 

51 AAHGCQHIFG NKY AFFAILL PMDFYIAVCI EFDLGFSIQM QFQFFAEHGV 

101 RLV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 103>: 

a018 . seq 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORF 018.a>: 

a018 . pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 

51 AAYGCQHIFG NKY AFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 

101 RLV* 



moi8/aOi8 86.4% identity over a 103 aa overlap 



10 20 30 40 50 60 

m018 . pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARIFFDIDI FQTDIVNVRT AAHGCQHIFG 

Ml | I I M : I : : I M I I I It : I I : I I I I I I M M I I : t I I M I I I I I ! I : I I I I! II 
a018 MQQGQLVGRVARNKDMRNTGLHSQRIGNGYAARIFFDIDVFQTDIVNVRTAAYGCQHIFG 

10 20 30 40 50 60 

70 80 90 100 

m018 .pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I I I I I I I I I II I I I I I : It I I I I I I M I I I I : I M I I I I 
a0l8 NKYAFFAILLPMDFYIAVCVEFGLGFSIQMQFQFFTEHGFRLVX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF 018.ng) 
from N. gonorrhoeae: 

m0l8/g018 

10 20 30 40 50 60 

m018 .pep MQQRQLVGRI ACDEDMRNTGLHGQRVGNRYAARI FFDI DI FQTDI VNVRTAAHGCQHI FG 

Ml I I I II : I : : I II I : I I I I M : I I I II I : I I I I : II I I I II I I 1 I : : I I I I I I I 
g018 MQQGQLVGRV ARNKDMRNAGLHGQRIGNGYAARVFVD I DV FQTDIVNVRT AT YGCQH I FG 

10 20 30 40 50 60 
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70 80 90 100 

m018 . pep NKYAFFAILLPMDFYIAVCIEFDLGFSIQMQFQFFAEHGVRLVX 
I I I I 1 I I I M I I I I I I M I : 1 I I I I I I I i I I I I I I : I I I llll 
g018 NKYAFFAILLPMDFYIAVCVEFDLGFSIQMQ FQFFSEHGFRLVX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 105>: 

g019.seq (partial) 

1 . .ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 

51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 

101 GCGTTCCCAC CCgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 

151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 

2 01 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 

2 51 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>; 

g019.pep (partial) 

1 . .LLAALVLAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTLADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGDSAM AENVRKEWL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 107>: 



m019 . seq 


(partial ) 










1 


ATGTACCTAC 


CCTCTATGAA 


GCATTCCCTG 


CCGCTGCTGG 


CGGCCCTGGT 


51 


GCTTGCCGCG 


TGTTCTTCGA 


CAAACACACT 


GCCAGCCGGC 


AAGACCCCGG 


101 


CAGACAATAT 


AGAAACTGCC 


GACCTTTCGG 


CAAGCGTTCC 


CACCCGCCCT 


151 


GCCGAACCCG 


AAAGAAAAAC 


GCTGGCAGAT 


TACGGCGGCT 


ACCCGTCCGC 


201 


ACTGGATGCA 


GTGAAACAGA 


AAAACGATGC 


CGCCGTCGCC 


GCCTATTTGG 


251 


AAAACGCCGG 


CGACAGCGCG 


ATGGCGGAAA 


ATGTCCGCAA 


CGAGTGGCTG 


301 


AAGTCTTTGG 


GCGCACGCAG 


ACAGTGGACG 


CTGTTTGCAC 


AGGAATACGC 


351 


CAAACTCGAA 


CCGGCAGGGC 


GCGCCCAAGA 


AGTCGAATGC 


TACGCCGATT 


401 


CGAGCCGCAA 


CGACTATACG 


CGTGCCGCTG 


AACTGGTCAA 


AAATACGGGC 


451 


AAACTGCCTT 


CGGGCTGCAC 


CAAACTGTTG 


GAACAGGCAG 


CCGCATCCGG 


501 


CTTGTTGGAC 


GGCAACGACG 


CCTGGAGGCG 


CGTGCGCGGA 


CTGCTGGCCG 


551 


GCCGCCAAAC 


CACAGACGCA 


CGCAACCTTG 


CCGCCGCATT 


GGGCAGCCCG 


601 


TTTGACGGCG 


GTACACAAGG 


TTCGCGCGAA 


TATGCCCTGT 


T G AACGT CAT 


651 


CGGCAAAGAA 


GCACGCAAAT 


CGCCGAATGC 


CGCCGCCCTG 


CTGTCCGAAA 


701 


TGGAAAGCGG 


TTTAAGCCTC 


GAACAACGCA 


GTTTCGCGTG 


GGGCGTATTG 


751 


GGGCATTATC 


AGTCGCAAAA 


CCTCAATGTG 


CCTGCCGCCT 


TGGACTATTA 


801 


CGGCAAGGTT 


GCCGACCGCC 


GCCAACTGAC 


CGACGACCAA 


ATCGAGTGGT 


851 


ACGCCCGCGC 


CGCCTTGCGC 


GCCCGACGTT 


GGGACGAGCT 


GGCCTCCGTT 


901 


AT CTCGC AT A 


TGCCCGAAAA 


ACTGCAAAAA 


AGCCCGACCT 


GGCTCTACTG 


951 


GCTGGCACGC 


AGCCGCGCCG 


CAACGGGCAA 


CACGCAAGAG 


GCGGAAAAAC 


1001 


TTTACAAACA 


GGCGGCAGCG 


ACGGGCAGGA 


ATTTTTATGC 


GGTGCTGGCA 


1051 


GGGGAAGAAT 


TGGGTCGGAA 


AATCGATACG 


CGCAACAATG 


TGCCCGATGC 


1101 


CGGCAAAAAC 


AGCGTCCGCC 


GCATGGCGGA 


AGACGGTGCA 


GTCAAACGCG 


1151 


CACTGGTACT 


GTTCCAAAAC 


AGCCAATCTG 


CCGGTGATGC 


AAAAATGCGC 


1201 


CGTCAGGCTC 


AGGCGGAATG 


GCGTTTTGCC 


ACACGCGGCT 


TTGACGAAGA 


1251 


CAAGCTGCTG 


ACCGCCGCGC 


AAACCGCGTT 


CG AC CACGGT 


TTTTACGATA 


1301 


TGGCGGTCAA 


CAGCGCGGAA 


CGCACCGACC 


GCAAACTCAA 


CT AC AC CTTG 


1351 


CGCTATATTT 


CGCCGTTTAA 


AGACACGGTA 


ATCCGCCACG 


CGCAAAATGT 


1401 


TAATGTCGAT 


CCGGCTTGGG 


TTTATGGGCT 


GATTCGTCAG 


GAAAGCCGCT 


1451 


TCGTTATAGG 


CGCGCAATCC 


CGCGTAGGCG 


CGCAGGGGCT 


GATGCAGGTT 


1501 


ATGCCTGCCA 


CCGCGCGCGA 


AATCGCCGGC 


AAAATCGGTA 


TGGATGCCGC 


1551 


ACAACTTTAC 


ACCGCCGACG 


GG. . . 







This corresponds to the amino acid sequence <SEQ ID 108; ORF 019>: 

m019 . pep (partial ) 

1 MYLPSMKHSL PLLAALVLAA CSSTNTLPAG KTPADNIETA DLSASVPTRP 
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51 AEPERKTLAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGS P 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRS FAWGVL 

2 51 GHYQSQNLNV PAALDYYGKV ADRRQLTDDQ IEWYARAALR ARRWDELASV 

3 01 ISHMPEKLQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLA 

3 51 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

4 01 RQAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
4 51 RYISPFKDTV IRHAQNVNVD PAWYGLIRQ ESRFVIGAQS RVGAQGLMQV 
501 M PAT ARE I AG KIGMDAAQLY TADG . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 109>: 

a019 . seq 

1 ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 

51 GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 

151 GCCGAACCCG AANGAAAAAC GTNGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTNTGCAC ANGAATATGC 

351 NAAACTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 

4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 . TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

7 51 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 NGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 

901 ANCNCGNNNN TGCNNGANAA ACNNNNNNAN AGNCNNANNT NGNTNNANTG 

951 NNTGGCACGC AGCCGCGCCG CNACGGGCAA CACGCAANAN G C GG AN AAA C 

1001 TNTACAAACA GGCGGCAGCA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 

1051 GGGGAAGAGT TGGGGCGCAN AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 

1151 CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 

1201 CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 

14 01 TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 

1601 CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 

1651 TATAACGCCG GTCCCGG CAG GGCGCGCCGA TGGCAGGCGG ACACGCCCCT 

1701 CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 

17 51 ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 

1801 GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 

1851 A 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 019.a>: 

a019.pep 

1 MYPPSLKHSL PLLVXLVLAA CSXTNT LSAD KTPADNIETA DLSASVPTXP 

51 AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRS FAWGVL 

251 GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ IEWYARAAXX XRXXXXXAXX 

301 XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXNFYAVLX 

351 GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAGDAKMR 

4 01 RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
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4 51 RYISXXXDTV IRHAQNVNVD PAWVYGLIRQ ESRFVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKRRL QNNEVLATAG 

551 YNAGPGRARR WQADTPLEGA VYAETI PFSE TRDYVKKVMA NAAYYASLFG 

601 APHIPLKQRM GIVPAR+ 



moi9/aoi9 88.9% identity over a 524 aa overlap 

10 20 30 40 50 60 

m019 . pep MYLPSMKHSLPLLAALVLAACSSTNTLPAGKT PADNIETADLSASVPTRPAEPERKTLAD 

j I I I : I I ! I I I I : I I I I I 1 I I I I I I I I I 1 I I II I I I I I I I I I I III!! II II 
a019 MYPPSLKHSLPLLVXLVLAACSXTNTLSADKTPADNIETADLSASVPTXPAE PEXKTXAD 

10 20 30 40 50 60 



70 80 90 100 110 120 

mOl 9 . pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 
I M I i I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I f I I I I I I 
a019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLXAXEYAKLE 

70 80 90 100 110 120 



130 140 150 160 170 180 

m019 . pep PAGRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 
II I I I M I I I I M I I M I I I I I I M I I I M I i I I M I I I M I I II I I I I I I I I I I I I I I 
a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

130 140 150 160 170 180 



190 200 210 220 230 240 

m019.pep LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 
I II I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I J I I I I I I I I M I I I I 
a019 LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 

190 200 210 220 230 240 



250 260 270 280 290 300 

m019 . pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I 
a019 EQRSFAWGVLGHYQSQNLNVPAALDYXGKVADRRQLTDDQIEWYARAAXXXRXXXXXAXX 

250 260 270 280 290 300 



310 320 330 340 350 360 

m019.pep ISHMPEKLQKSPTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 
I : I I I I t I I I I I I 1 I I I I I I I I : I I I I I I I I I I I I I III 

a019 XXXXXXKXXXXXXXXXXXARSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXIDT 

310 320 330 340 350 360 



370 380 390 400 410 420 

mOl 9 . pep RNN V PDAGKN S VRRMAE DG AVKRAL VL FQN S QS AG DAKMRRQAQAEWR FATRG FDE DKLL 

! I I I I I II I II i I I I I I I : I I I I I I I : I I :: I I M I I I I I I I I I I I I II M I I I I I I 
a019 RNNVPDAGKXSVLRMAEDGAIKRALVLFRNSRTAGDAKMRRXAQAEWRFATRGFDEDKLL 

370 380 390 400 410 420 



430 440 450 4 60 470 480 

m019.pep TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISPFKDTVIRHAQNVNVDPAWVYGLIRQ 
I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I 1 I I I I I I I 
a019 TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISXXXDTVIRHAQNVNVDPAWVYGLIRQ 

430 440 450 4 60 470 480 



490 500 510 520 

m019 . pep E S R FV I G AQS RV G AQG LMQVM PAT ARE I AGK I GM D AAQL YT A DG 

M I I I : I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I 
a019 E SR FVMG AQS RVG AQG LMQVM PAT ARE I AGK I GMDAAQ L YT ADGN I RMGT W YMADT KRRL 

490 500 510 520 ■ 530 540 



a019 



QNNEVLATAGYNAGPGRARRWQADTPLEGAVYAETIPFSETRDYVKKVMANAAYYASLFG 
550 560 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019.ng) 
from N. gonorrhoeae: 

g019/m019 

10 20 30 40 49 

g019 . pep LLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPEGKTLAD 

MllllllMIIIIIIIIIIIMMMIIilllllllllllll Mill 

m019 MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNI ETADLS ASVPTRPAEPERKTLAD 

10 20 30 40 50 60 

50 60 70 80 89 

g019 - pep YGGYPSALDAVKQNNDAAAAAYLENAGDSAMAENVRKEWL 

IIIIIIMIIIIhlllhllMIIMMIMMIhlll 

m0 1 9 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 1>: 

g023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG C CTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

2 01 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

2 51 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtCtGGCTG 

3 01 GTCGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 023. ng>: 

g023 . pep 

1 MVERKLTGAH YGLRDWVMQR ATA V IMLIYT VALLWLFAL P KEYPAWQAF 

51 FSQAWVICVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGVR L FLQVATIVWL 

101 VGCLVYSVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

m023 - seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATG CAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG C CTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAG T C AAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

2 01 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

3 01 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ED 1 14; ORF 023>: 

m023 -pep 

1 MVERKLTGAH YGLRDWVMQR ATA VI MLIYT VALLWLFSL P KEYSAWQAF 

51 FSQTWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGV RL FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

a023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 
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201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATNA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
301 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 023. a>: 

a023 .pep 

1 MVERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLWLFAL P KEYSAWQAF 

51 FSQTWVKV FT QVSFIAVFLH AWVG1 RDLWM DYXKPFGVR L FLQVATIVWL 

101 VGCLVY SIKV IWG + 

m023/a023 96.5% identity over a 1 13 aa overlap 



10 20 30 40 50 60 

m02 3 . pep MVERKLTGAH YGLRDWVMQRATAVIML I YTVALLWLFSLPKEYSAWQAFFSQTWVKV FT 

I I I I M I M I I I I II I : I I I I I M I I I I I 11 1 I I I I I I : I I I I I I I M I I I I I I I I I I I I 
a 0 2 3 MVERKLTGAH YGLRDWAMQRAT AV IML I YT V ALL WL FA L PKEYSAWQAF FSQTWVKV FT 

10 20 30 40 50 60 



70 80 90 100 110 

m023 . pep QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 
I I I I I I I I II I ! I I I I I II I I I I ! I I I I I I I M I I II I ! I I II M 1 : ! I I ! t 1 
a 02 3 QVSFIAVFLHAWVGIRDLWMDYXKPFGVRLFLQVATIVWLVGCLVYSIKVIWGX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023 .ng) 
from N. gonorrhoeae: 

g023/m023 



10 20 30 40 50 60 

g023 . pep MVERKLTGAHYGLRDVA7MQRATAVIMLIYTVALLWLFALPKEYPAWQAFFSQAWVKVFT 

MIIMIIIIMIIIIIIIIMIIMIIIMIIIIIIhlllll IMMMhIMIII 

m0 2 3 MVERKLTGAHYGLRDWVMQRATAV I ML I YTVALLWLFS LPKEYSAWQAFFSQTWVKVFT 

10 20 30 40 50 60 



70 80 90 100 110 

g023 . pep QV S F I AV FLHAWVG I RDLWMD Y I KP F GVRL FLQ V AT I VWLVGC L V Y S V KV I WGX 

I I III li MMM I I li I III I I I I II ! II II II Ml I II I II M 1 I IIMMi 

m023 QVSFIAVFLHAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 17>: 

g025 . seq 

1 ATGTTGAAAC AAAcgACACT TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 

51 GTTGGGCGGT TGcgCCACCC AACAGCCTGC TccTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTATCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 

151 ACGCCGTACA ATGCCGCTCC TGCCGCCAac gatgcGCCgT ATGTGCCGCC 

201 CGTGCAAact gcgccggttT ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 

251 CACCTGCCGT TTCGGgtaca tatgtTCCTT CTTACGCACC CgtcgACATC 

301 aacgCGGCGa cgCataCTAT TGTGCGTGGC GACACgGtgt a c a AC ATTTc 

351 caaAcgCtac CATATCTCTC AAGACGATTT CCGTGCGTGG AACGGCATGA 

4 01 CCGACAATAC GTTGAGCATC GGTCAGATTG TTAAAGTCAA ACCGGCaggA 

4 51 TATGCCGCAC CGAAAACCGC AGCCGTAGAA AGCAGGCCCG CCGTACCGGC 

501 TGCCGCGCAA ACCCCTGTGA AACCCGCCGC gcaACCGCCC GTTCAGTCCG 

551 CGCCGCAACC TGCCGCGCCC GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 

601 GCGCCCGCCC CGCAATCTCC TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 

651 CGGCGG CATT GTTTGGCAGC GTCCGACCCA AGGTAAAGTG GTTGCCGATT 
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7 01 TCGGCGGCGG CAACAAGGGT GTCGATATTG CCGGCAATGC CGGACAACCC 
751 GTTTTGGCGG CGGCTGACGG CAAAGTGGTT TATGCCGGTT CAGGTTTGAG 

8 01 GGGATACGGA AACTTGGTCA T CAT C C AG C A CAATTCCTCT TTCCTGACCG 
8 51 CGTACGGGCA CAACCAAAAA TTGCTGGTCG GCGAAGGTCA GCAGGTCAAA 
901 CGCGGTCAGC AGGTTGCTTT GATGGGTAAT ACCGATGCTT CCAGAACGCA 
951 GCTTCATTTC GAGGTGCGTC AAAACGGCAA ACCGGTTAAC CCGAACAGCT 

1001 AT AT CG CG T T CTGA 



This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 025. ng>: 

g025 -pep 

1 MLKQTTLLAA CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLS I GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

2 01 APAPQSPAAS PSGTRSVGGI VWQRPTQGKV VADFGGGNKG VDIAGNAGQP 

251 VLiAAADGKW YAGSGLRGYG NLVIIQHNSS FLTAYGHNQK LLVGEGQQVK 

301 RGQQVALMGN TD AS RTQ liH F EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 



. seq 


(partial ) 










1 


. . GTGCCGCCGG 


TGCAAAGCGC 


G CCGGTTT AT 


ACGCCTCCTG 


CTTATGTTCC 


51 


GCCGTCTGCA 


CCTGCCGTTT 


CGGGTACATA 


CGTTCCTTCT 


TACGCACCCG 


101 


TCGACATCAA 


CGCGGCGACG 


CATACTATTG 


TGCGCGGCGA 


CACGGTGTAC 


151 


AACATTTCCA 


AACGCTACCA 


TATCTCTCAA 


GACGATTTCC 


GTGCGTGGAA 


201 


CGGCATGACC 


GACAATACGT 


TGAGCATCGG 


TCAGATTGTT 


AAAGTCAAAC 


251 


CGGCAGGATA 


TGCCGCACCG 


AAAGCCGCAG 


CCGTAAAAAG 


CAGGCCCGCC 


301 


GTACCGGCTG 


CCGCGCAACC 


GCCCGTACAG 


TCCGCACCCG 


TCGACATTAA 


351 


CGCGGCGACG 


CATACTATTG 


TGCGCGGCGA 


CACGGTGTAC 


AACATTTCCA 


401 


AACGCTACCA 


TATCTCTCAA 


GACGATTTCC 


GTGCGTGGAA 


CGGCATGACC 


451 


GACAATATGT 


TGAGCATCGG 


TCAGATTGTT 


AAAGTCAAAC 


CGGCAGGATA 


501 


TGCCGCACCG 


AAAACCGCAG 


C CGTAG AAAG 


CAGGCCCGCC 


GTACCGGCTG 


551 


CCGTGCAAAC 


CCCTGTGAAA 


CCCGCCGCGC 


AACCGCCTGT 


GCAGTCCGCG 


601 


CCGCAACCTG 


CCGCGCCCGC 


TGCGGAAAAT 


AAAGCGGTTC 


CCGCGCCCGC 


651 


CCCGCAATCT 


CCTGCCGCTT 


CGCCTTCCGG 


CACGCGTTCG 


GTCGGCGGCA 


701 


TTGTTTGGCA 


GCGTCCGACG 


CAAGGTAAAG 


TGGTTGCCGA 


TTTCGGCGGC 


751 


AACAACAAGG 


GTGTCGATAT 


TGCCGGTAAT 


GCGGGACAGC 


CCGTTTTGGC 


801 


GGCGGCTGAC 


GGCAAAGTGG 


TTTATGCCGG 


TTCAGGTTTG 


AGGGGATACG 


851 


GAAACTTGGT 


CATC AT C C AG 


CATAATTCTT 


CTTTCCTGAC 


CGCATACGGG 


901 


CAC AAC C AAA 


AATTGCTGGT 


CGGCGAGGGG 


CAGCAGGTCA 


AACGCGGTCA 


951 


GCAGGTTGCT 


TTGATGGGCA 


ATACCGATGC 


TTCCAGAACG 


CAGCTTCATT 


1001 


TCGAGGTGCG 


TCAAAACGGC 


AAACCGGTTA 


ACCCGAACAG 


CTATATCGCG 


1051 


TTCTGA 











This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 025>: 

m025.pep (partial) 

1 . .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAPVDINAAT HTIVRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKSRPA 

101 VPAAAQPPVQ SAPVDINAAT HTIVRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DNMLSIGQIV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 PQPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIWQRPT QGKWADFGG 

251 NNKGVDIAGN AGQPVLAAAD GKWYAGSGL RGYGNLVIIQ HNSS FLTAYG 

3 01 HNQKLLVGEG QQVKRGQQVA LMGNTDASRT QLHFEVRQNG KPVNPNSYIA 

351 F* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

a025 . seq 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCCCACCC AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 
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151 ACGCCGTACA ATGCCGCTCC TGCCGCCAAC GATGCGCCGT ATGTGCCGCC 

201 GGTGCAAAGC GCGCCGGTTT ATANGCCTCC TGCTTATGTT CCGCCGTCTG 

2 51 CACCTGCCGT TTCGGGTACA TACGTTCCTT CTTACGCANC CGTCGACATC 
301 AACGCGGCGA CCCATACTAT TGTGCGCGGC GACACCGTGT ACAAGATTTC 

3 51 CAAATGCTAC CATATCTCTC AAGACGATTT CCGTGCGTGG AACGGCATGA 

4 01 CCGACAATAC GTTGAGCATC GGTCAGATTG TTAAAGTCAA ACCGGCAGGA 
4 51 TATGCCGCAC CGAAAGCCGC AGCCGTAAAA AGCAGGCCCG CCGTACCGGC 
501 TGCCGCGCAA CCGCTCGTAC AGTCCGCACC CGTCGACATC AACGCGGCGA 
551 CGCATACTAT TGTGCGCGGC GACACGGTGT ACAACATTTC CAAACGCTAC 
601 CATATCTCTC AAGACGATTT CCGTGCGTGG AACGGCATGA CCGACAATAC 
651 GTTGAGCATC GGTCAGATTG TTAAAGTCAA ACCGGCAGGA TATGCCGCAC 
7 01 CGAAAGCCGC AGCCGTAAAA AGCAGGCCCG CCGTACCGGC TGCCGTGCAA 
7 51 ACCCCTGTGA AACCCGCCGC GCAACCGCCT GTGCAGTCCG CGCCGCAACC 
801 TGCCGCGCCC GCTGCGGAAA ATAAAGCGGT TCCCGCGCCC GCCCCGCAAT 
851 CTCCTGCCGC TTCGCCTTCC GGCACGCGTT CGGTCGGCGG CATTGTTTGG 
901 CAGCGTCCGA CGCAAGGTAA AGTGGTTGCC GATTTCGGCG GCAACAACAA 
951 GGGTGTCGAT AT T G C AG G AA ATGCGGGACA GCCCGTTTTG GCGGCGGCTG 

1001 ACGGCAAAGT GGTTTATGCA GGTTCCGGTT TGAGGGGATA CGGCAATTTG 

1051 GTCATCATCC AGCATAATTC TTCCTTCCTG ACCGCATACG GGCACAACCA 

1101 AAAATTGCTG GTCGGCGAAG GCCAGCAGGT CAAACGCGGG CAGCAGGTCG 

1151 CTTTGATGGG CAATACCGAG GCTTCTAGAA CGCAGCTTCA TTTCGAGGTG 

1201 CGGCAAAACG GCAAACCGGT TAATCCGAAC AGCTATATCG CGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 025.a>: 

a025 . pep 

1 MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGNSGMQTVP SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQS APVYXPPAYV PPSAPAVSGT YVPSYAXVDI 

101 NAATHT I VRG DTVYKISKCY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKAAAVK SRPAVPAAAQ PLVQSAPVDI NAATHT I VRG DTVYNISKRY 

201 HISQDDFRAW NGMTDNTLSI GQIVKVKPAG YAAPKAAAVK SRPAVPAAVQ 

251 TPVKPAAQPP VQSAPQPAAP AAENKAVPAP APQSPAASPS GTRSVGGIVW 

301 QRPTQGKWA DFGGNNKGVD IAGNAGQPVL AAADGKVVYA GSGLRGYGNL 

351 VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE ASRTQLHFEV 

4 01 RQNGKPVNPN SYIAF* 

m025/a025 97.4% identity over a 351 aa overlap 

10 20 30 

m025 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

I I I I I I I I i I : I I I It i I I I I I I I ! II I I I 
a02 5 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m025.pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
II I I I I I I I I M II I I I I I : I II I M I I I II M I I I I I ! I I I I M II I I 1 I I I I I I I I 
a025 YAXVDINAATHTIVRGDTVYKISKCYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
100 110 120 130 140 150 

100 110 120 130 140 150 

m025 . pep KAAAVKSRPAVPAAAQPPVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 

I I I I I I I I f I I I I I I I I I M I I I I I I I I II I II II I I I I I I I I I I M I I M I II I I I I I 
a025 KAAAVKSRPAVPAAAQPLVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 

160 170 180 190 200 210 

160 170 180 190 200 210 

m02 5 . pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

II I I I M I I I I I I I I f I I I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
a025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAV PAAVQT PVKPAAQPPVQSAPQPAAPAAEN 

220 230 240 250 260 270 

220 230 240 250 260 270 

m025 . pep KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
I I I I I I I I I I I I I I II I M I I I I I I M I I I I I I I M I I I I I II I I I I I I I I I II I I I I I I 
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a0 25 KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKVVADFGGNNKGVDIAGNAGQPVLAAAD 
280 290 300 310 320 330 

280 290 300 310 320 330 

m025 . pep GKWYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDASRT 
| M f I M I I I I I I I II II t II 1 I I I I I i I I i I I I I ! I M I I I I M I I I M I I I I I : I II I 
a02 5 GKVVYAGSGLRGYGNLVIIQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTEASRT 
340 350 360 370 380 390 

340 350 
m02 5.pep QLHFEVRQNGKPVN PNSYIAFX 

II I I I I I I M M t II I I I I I I I 
a025 QLHFEVRQNGKPVNPNSYIAFX 
400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025.ng) 
from N. gonorrhoeae: 

m025/g025 

10 20 30 

m025 pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

llllhllthMIIIIIIMIIIIIMII 

g 025 GMQTVSSAPVYNPYGATPYNAAPAANDAPYVPPVQTAPVYSPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m0 2 5 pep YAP VD I NAATHT I VRGDTVYN I S KRYH I SQDDFRAWNGMTDNTLS I GQ I VKVKP AG YAAP 

MIIIMIIIIIIIIMIIIIIIIIIIMIMIMIIIIIIIIMMMIMIIMIIII 

g 025 Y AP VD I NAATHT I VRGDTVYN I S KRYH I S QDD FRAWNGMTDNTLS I GQ I VKVKP AG YAAP 

100 110 120 130 140 150 

100 110 120 130 140 150 

m025 . pep KAAAVKS RP AV PAAAQPPVQSAP VI) I NAATHT I VRGDTVYN IS KRYH IS QDD FRAWNGMT 

I 

g025 K 



160 170 180 190 200 210 

m025 pep DNMLS I GQ I VKVKP AGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

llllllllllllhlllllllllllllllllllllllll 

g025 TAAVESRPAVPAAAQTPVKPAAQPPVQSAPQPAAPAAEN 

160 170 180 190 

220 230 240 250 260 

m025 pep KAVPAPAP- -QSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAA 

MMIIII MINI I II IIIMM I MMIinMI I! IMM!! 1 I IIMI 1 MM 

g02 5 KAVPAPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGGNKGVDIAGNAGQPVLAA 
200 210 220 230 240 250 

270 280 290 300 310 320 

mO 2 5 - pep ADGKWYAGSGLRGYGNLVI IQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 M M 1 1 1 1 i 1 1 1 1 It I i 1 1 1 1 1 1 1 ! I M 1 1 1 1 1 1 I 1 1 M 

g 0 2 5 ADGKWYAGSGLRG YGNLV 1 1 QHNS S FLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

260 270 280 2 90 3 00 310 



ni0 2 5 - pep 



330 340 350 

RTQLH FEVRQNGKP VN PNSYI AFX 
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g 0 2 5 RTQLHFEVRQNGKPVNPNS Y I AFX 

320 330 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 13- 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC CGCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA ATCAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

2 01 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAATCGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

301 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 

351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

4 01 TCCAACACGC CGGCGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 

4 51 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT C AC ACG C AAA ACGTGTTCCG 

501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 

551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 

601 GCCCGACACA ATCATCAGCG CGTACAGACC TTCCGtacac acctccaatt 

651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 

701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccctgggg 

751 tgccccaacg gatttagtaa cctcccaatg actctgcttg tcgccccctt 

801 cgcccgcttt ctccttccgg gaaaacttgt tgtccccgtc ttacattaa 



This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 031.ng>: 

g031 .pep 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGHGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDNDAGK VNRQKAAAAY GIGKRKHKQP 

201 ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

251 CPNGFSNLPM TLLVAPFARF LLPGKLWPV LH* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1 15>: 

m031.seq (partial) 

1 ... CGCCTGAAGC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 

51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 

101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 

151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 

201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 

251 AG. . . . 

This corresponds to the amino acid sequence <SEQ ID 1 16; ORF 03 1>: 

mO 3 1 . pep (partial ) 

1 . . . RLKHGVGLHF YSAIRLFTQA VIEFPQTAEH CRRTRDQHQE RRNRQGFRRP 
51 VQHVGRRNQQ QRHSQTCGQS GRNHAQKQQC ATRQ. . . . 



The following partial DNA sequence was identified in Af meningitidis <SEQ ID 1 17>: 

a031 . seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 

51 CTGCCGGCGG ACGCGCGACC AGCATCAGGA ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGGCAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 

2 01 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 031.a>: 

a031.pep (partial) 

1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQGFRRPVQH VGRRNQQQRH 
51 SQTCGQSGRN HAQKQQCATR Q 
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m03i/a03i 100.0% identity over a 71 aa overlap 



10 20 30 40 50 60 

m031 .pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

j I E I I 1 I I 1 I I I I M I I I M 1 I I I I I t II ! I II I I I I I I I I I I I I I 1 
a031 IRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

10 20 30 40 

70 80 
m031 . pep QRHSQTCGQSGRNHAQKQQCATRQ 
I I I M ! I I I I I I I f I I I I I I I I I I 
a031 QRHSQTCGQSGRNHAQKQQCATRQ 
50 60 70 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 031 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORF 031.ng) 
from N. gonorrhoeae: 

m031/g031 

10 20 30 

m031 . pep RLKHGVGLHFYSAIRLFTQAVIEFPQTAEH 

I : : I : : I I I I I I I I I I M I I I It 

g 031 NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTDIRLFTQAVIEFPQTAEH 
60 70 80 90 100 110 

40 50 60 70 80 

m031 . pep CRRT R DQHQERRNRQG FRR PVQHVGRRNQQQRH S - QT CGQ SGRNHAQKQQC ATRQ 

I : I I I M I I I 1 I I I I I I I I I I I 1 : I Mil : I : I : I : : : : : : I : I : 
g 031 CQRTRDQHQERRNRQGFRRPVQHAGGRNQQTEHDEQSCLRQPSQTVHHTQNVFRRTVALV 
120 130 140 150 160 170 

g 031 TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQT FRTHLQFPINVIAVSRVKMFFTS 

180 190 200 210 220 230 



The following partial DNA sequence was 

g032 . seq 

1 ATGCGGCGAA ACGTGCCTGC 

51 GGCGTTTTTG GATTTGGCGT 

101 AACAGGGCTT TGCCGTCCGA 

151 CAAGGCTTCC ACGCTTTTGC 

201 CTTTGCCGGT AACGTGTACC 

251 TTCAAGCCGT GTATCTCGCT 

301 GAACAGCGCG TGGTCGCGCA 

351 AATCCAACAT CCGGTTCAGC 

4 01 TCGGGCTGCT TCGGCGGTTT 

4 51 CCCGCGTTTG ACCAGCCAGG 

501 GCGCCAGCGT CCGACCGTCC 

551 GccgTAAAAT CGCGCCGCGA 

601 AGACGGCATT TGTGCCAACA 

651 ATGCCGCAAC CGTGTGCTCC 

7 01 TAAGCGTCAA AATacgccgC 

7 51 CAAATATCAA AAAACCAGTG 



identified in N. gonorrhoeae <SEQ ID 1 19>: 



CGTCGCCGTA 
TGGCTCAGGC 
TGCCGTCTGA 
CGGTCAGCGG 
CACGCTTCGT 
CACGCGCAAA 
CCGCCAGCGG 
CATTCCTGCG 
GATGTCGGCG 
CGCAATACTG 
AAACCGCGCT 
CAAGTCCTGC 
GTGCAAACAG 
GACTTGCCCT 
AAACccgTCC 
A 



TTGCGCCGCC 
GCGTGCCGTT 
CGCAGCGGCA 
AACCTGACGC 
CCAAATATAC 
CCGCTGCCGT 
GTCGCCGCCG 
GCAGGGCTTT 
GTCGGGTCGG 
CCGCCACGCC 
GCGACAGCCG 
GGCACGCcgc 
TTCTTCCAAA 
CGCGCACGAT 
AAAACCATAA 



CACGATTCGA 
CCTGCCGGTA 
GATAGTTTTT 
TGCTTGCGCC 
ATCATCTGCA 
ACATCAGCTC 
TCCACGGGCA 
GGGTACGCGC 
CGCGCATCAG 
GACAGCTTGC 
CCGCAACGCC 
ctgcaTCTTC 
TCGCGCCGGT 
GTTTTCCAAA 
CCGTCCCACA 



This corresponds to the amino acid sequence <SEQ ID 1 10; ORF 032.ng>: 

g032 .pep 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 
51 QGFHAFAGQR NLTLLAPFAG NVYPR FVQIY IICIQAVYLA HA QTAAVHQL 
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101 EQRVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 

151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAACIF 

201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQISVKIRR KPVQNHNRPT 

251 QISKNQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1 1 1>: 

m032 . seq (partial) 

1 ATGCGGCGAA ACGTGCmTGC mGTCGCCGTT kTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGACCAGCGG CACCTGCCGC TgTT . GCGCC 

201 CTTTGCCGAT AAcGTGTACC CACGCyTCGT CCAAATAGAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGGGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGTGCATCAG 

4 51 GCCGCGCTTT ACCAGCCAAA CGCAATACTG CCGCCAAGAC GAAAGCTTGC 

501 GAGCCAGCGT CCGTTCCCCC AAACCGCG . . . 

This corresponds to the amino acid sequence <SEQ ID 1 12; ORF 032>: 

m032 . pep (partial > 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFADQR HLPLXAPFAD NVYPR XVQID I1CIQAVYLA HA QTAAVHQF 

101 EQGWAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA. . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

a032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

201 CTTTGCCGGT AACGTGTACC CACGCCTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGAT CGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TATGCAGCAG 

4 51 ACCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCAAGAC GACAGCTTGC 

501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 

601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

7 01 TAAGCGTCAA AATGCGCCGC AAACCCGTCC AAAAC CAT AA CCGCCCCACA 

7 51 CAAATATCAA AAAAACAGTG A 

This corresponds to the amino acid sequence <SEQ ID 1 14; ORF 032. a>: 

a032 .pep 

1 MRRNV PAVAV LRRPLRQTFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLPLLASFAG NVYPR LVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 

m032/a032 88.1% identity over a 176 aa overlap 

10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFL DLALAQARAV PAG KQGFAVRCRLTQRQ I VFQGFHAFADQR 

1 I I I I MM M I M M I M M M I M M M M M M M M M M M M M M M I M 
a032 MRRNVPAVAVLRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m032 . pep HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGWAHRQRVAAVHGQIQH 
: I I I I i I Mill Ml I I I I I I I I I ! I I I I I M I I I II I : I I I II I I I I II I I I I 
a03 2 NLPLLASFAGNVYPRLVQIYIICIQAVYLAHAQTAAVHQFEQRVIAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

m032 . pep PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
M I I I I M M I I M M I I 1 M I I I I I I : : I : I : ! I : M M I II : I I ill III 
a032 PVQPFLRQGFGYALGLLRRFDVGGRVGMQQTAFDQPGAILPPRRQLARQRPRIQTALRQP 

130 140 150 160 170 180 

a032 PQRRRKIALRQALRHAACIFRRHLCQQRKQFFQIAPVCRHRVLRLALAHDVFQISVKMRR 

190 200 210 220 230 240 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032.ng) 
from N. gonorrhoeae: 

m032/g032 

10 20 30 40 50 60 

MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFADQR 
1 I I I I I I I I III :: I I 1 I I I I II I I I I I I I M I 1 I M I I II I I I I I I M I I I I II 
MRRNVPAVAVLRRPRFEAFLDLAIAQARAVPAGKQGFAVRCRLTQRQIVFQGFHAFAGQR 
10 20 30 40 50 60 

70 80 90 100 110 120 

HLPLXAPFADNVYPRXVQIDIICIQAVYLAHAQTAAVHQFEQGVVAHRQRVAAVHGQIQH 
: I | III! I I I I I III I I I I I I I I I I I I M I II I I : t I 1 I I I I I I I I M I I I I I I 
N LTL LAP FAGN VY PR FVQ I Y 1 1 C I QAVYLAHAQT AAVHQLEQR WAHRQRVAAVHGQ I QH 
70 80 90 100 110 120 

130 140 150 160 170 

PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
I I | I I I I I II I M I I II II I I II I M I : I I I : 11:1111111:11 ill III 
PVQPFLRQGFGYALGLLRRFDVGGRVGAHQPAFDQPGAILPPRRQLARQRPTVQTALRQP 
130 140 150 160 170 180 

PQRRRKIAPRQVLRHAACIFRRHLCQQCKQFFQIAPVCRNRVLRLALAHDVFQISVKIRR 
190 200 210 220 230 240 



m032 . pep 
g032 

m032 . pep 
g032 

m032 . pep 
g032 

g032 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 15>: 

g033.seq 

1 ATGGCGGCGG CGGACAAACT CTTGGGCGGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGAGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGTTG CCCAAATATC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAAcgg 

251 G CAAGGT ATT AGACAAAATA CCCGGCGCGA TGGagtTTGC CCAAAAAGTC 

3 01 GAACAcaaaA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

3 51 GCTGTCGCTG TTTGAAAATT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAGAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
451 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

5 01 ACTCGCCGAA AACGACCCcg tcaAATACCA CGCCGTCGCc aACCTGCcta 
551 AAGAAGGCGG GGCGCAAATg ccGTCTGAAA AAGAACCCAA GCCCGCCgCc 

6 01 aaaccgACCT ATACCCAAGT ATTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
6 51 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
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7 01 GACTGGTGGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 

8 01 CATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 
851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 
901 GTCGACCGTG CGGGCATCGT CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 
951 GTACGATTTG AGCTTCTTGC GCTGTGTGCC GAACATGATT GTTGCCGCGC 

10 01 CG AG CGATG A AAACGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCG 

1051 GATGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GG C ATGG AAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCC TTcatTGCCT TCGGCAGTAT GGTCGCCACC 

12 01 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTt 

12 51 cgtcaaacCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 

13 01 accGCATCGT TACCCTTGAA GAAAACGCCG AACAGGG CGG CGCAGGCGGC 

13 51 GCGGTCTTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGAACA CGGCGATCCG AAAAAACTTT 
14 51 TGGACGATTT GGGTTTGAGT GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 
1501 TGGCTGCCGG ACCGTGATGC GGCAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 116; ORF 033.ng>: 

g033 .pep 

1 MAAADKLLGG DRRSVAI I GD GAMTAGQAFE ALN C AGDMD V DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAME FAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKEGGAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 I AEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

301 VDRAG I VGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG 1 1 RREGE KTA FIAFGSMVAT 

4 01 AIAVA EKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 

501 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1 17>: 

m033 . seq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG.CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCGAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GCCTGTTGAG T AC CGT C AAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

3 01 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 
351 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 
4 51 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGACCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 
551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 
6 01 AAACCGACCT AT AC C C AAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
6 51 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 
751 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 
801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 
851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 
901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT 
951 GTACGATTTG AGCTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

1201 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

12 51 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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13 01 ACCGCATCGT TACCCTTGAA 

13 51 GCGGTGCTGG AAGTATTGGC 

14 01 TTTGGGCGTT GCCGATACCG 
14 51 TAGACGATTT GGGCTTGAGT 
1501 TGGCTGTCGG AT CGGGATGC 



GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

TAACCGGACA CGGGGATCCG AAAAAACTTT 

GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 
GGCAAATTAA 



This corresponds to the amino acid sequence <SEQ ID 1 18; ORF 033>: 

m03 3 . pep 

1 MAAADKLLGS DRRSVAI IGD GAMTAGQAFE ALNCAXDMDV DLLWLNDN E 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

2 51 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHD I ALQNLPVLFA 

301 VD RAG I VG AD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG IIRREGEK TA FIAFGSMVAP 

401 ALAVAEKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 1 19>: 

a033 . seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

251 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

351 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 

4 51 AAAGGCCCGC AGCTTCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 

501 ACTCGCCGAA AACGATCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 

551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

801 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 

951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

1201 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

1251 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG AT CGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 120; ORF 033.a>: 

a033 . pep 

1 MAAADKQLGS DRRSVAI IGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

2 51 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHD I ALQNLPVLFA 
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301 V DRAG I VG AD GPTHAGLYDL SFLRCIPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG IIRREGEK TA FIAFGSMVAP 

4 01 ALAVAGKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGS 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 

501 WLSDRDAAN* 

m033/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

m033 . pep MAAADKLLGSDRRSVAIIGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMSISPNVGAL 
MINI I ! I I I I I I I I I I I I I I I I I I II I I I M 1 I I I I I II M I I I I I I I I M I I I I I 
a0 3 3 MAAADKQLGS DRRSVAI IGDGAMTAGQAFEALNCAGDMDVDLLWLN DNEMS I S PNVGAL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m033 .pep PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 
I I I I I I I I I I I II I I M I M II I I I 11 I I I I I I I M II I I I I I I I I I I I I I I I I I M I I I 
a033 PK Y LAS NVVRDMHGLLSTVKAQTGKVLDK I PGAME FAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
I M I I I M I II I II I I I I I I ! II I : I I I : II M I I I I I I I I I I II I M I I I I II I I I I ! I 
a033 FENFGFRYTGPVDGHNVENLVDVLEDLRGRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m033 . pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
a 033 NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVE FEQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

m033 .pep RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

II I I I I I I M I I I I I I M I I 1 I I M I I I I I I I I I I I I I 1 1 I I I M I I I I M 1 I 1 i I I I M 
a033 RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAIYSTFLQRAYDQLVHDIALQNLPVLFA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m033 . pep VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
I I I I 1 I I I I I I I I I I I I I I I I I I I I : I ! I I I II I I I I I I I I M I I I I I I 1 I II I t 1 I I I I 
a033 VDRAGIVGADGPTHAGLYDLSFLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

m033 .pep GTGTGAPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 
I I I I I : I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I II I 
a033 GTGTGVPVSDGMETVEIGKGIIRREGEKTAFIAFGSMVAPALAVAGKLNATVADMRFVKP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 
I M I I I I M I I I M M I I I I I I I I I I I I I : I II I I I I I I I I I II I i I I I I I I I I I I 1 I I I 
a 033 IDEELIVRLARSHDRIVTLEENAEQGGAGSAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

430 440 450 460 470 480 

490 500 510 

m033 . pep KKLLDDLGLSAEAVERRVRAWLSDRDAANX 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
a033 KKLLDDLGLSAEAVERRVRAWLSDRDAANX 

490 500 510 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033.ng) 
from N. gonorrhoeae: 



m033/g033 






m033 . 


pep 


MAAADKLLGSDRRSVAI IGDGAMTAGQAFEALNCAXDMDVDLLWLNDNEMS I S PNVGAL 

1 I | | I I I | I i : I I I 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 i 1 II 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 
MAAADKLLGGDRRSVAI I GDGAMTAGQAFE ALNCAGDMDVDLLWLNDNEMS I S PNVGAL 


60 


g03 3 




60 


m03 3 


f c P 


PKYIJ^ijNVVRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 

1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 L 1 1 1 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 E 1 1 1 

PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAEEAEHAKQSLSL 


120 


g03 3 




120 


mm 1 

111 v J o ■ 


pep 


FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 

IIIIIIIIIIIMIMMIMIIIIIIIIMIIIIIMIIIIIIIIIMIIIMIIIIII 

FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKIjAENDPVKYHAVA 


180 






180 


mm 1 

111 U J J . 


• P e P 


NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

M IIMMMM 1 IMMIMil nMMMMI MM I MMI 1 MM MMM MIM 

NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 


240 


g03 3 




240 


m03 3 
9033 


■ f e F 


R F PD R Y FD VG I AE QHAVT F AGG LAC E GM KP W A I Y S T FLQRA YDQL VHD I ALQN L P VL FA 

1 1 MIMMM 1 1 IMM II MMMMMMI II Ml II MMI 1 MM MMM Mill 

R F PDR YFDVG I AEQHAVT F AGGLACEGMKP WAI YS T FLQ RA YDQL VHD I ALQNL PVLF A 


300 
300 


m033 


.pep 


VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

1 1 IMIMIII 1 i Mill IMM .MMI ill M 1 II MMMMIMI 1 IIIM Mill 

VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 


360 


g033 




360 


m033 


.pep 


GTGTGAPVSDGMETVE I GKG 1 1 RREGE KTAFI AFGSMVAPALAVAE KLNATVADMRFVKP 

I 1 IMIMIIM 1 IMM M MMMMI Ml M 1 II M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GTGTGAPVSDGMETVE I GKG 1 1 RREGEKTAF I AFGSMVATALAVAE KLNATVADMRFVKP 


420 


g033 




420 


m033 


•pep 


IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

: Ml: MM Ml 1 MMI MMMIIIMM 1 MMI MIMIIM i I MUM MM 

IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTEHGDP 


480 


g033 




480 


m033 


•pep 


KKLLDDLGLSAEAVERRVRAWLSDRDAANX 510 

M Mill Ml 1 M MIM ; II IMIIII 

KKLLDDLGL S AEAVE RR VREW L PDRD AANX 510 




g033 







The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121>: 

g034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA C AC AAATGG C ACTCGTATCC ATGCGCCAAC 

101 TGCTTGAC C A CGCCGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGcggGCG 

251 CGCCGTTTTT GCGCCACCTG ATTCTGGCGG CAGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTgtgCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCCGT GATGATGGAC GGCTCTTTGC 

4 01 TCGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACC 
451 CGTACCGTCG TCAACTTCTC CCACGCCTGC GGCGTGTCCG TCGAAGGCGA 

5 01 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGAAGCAGGC GAAGAAGACG 
5 51 GAGTGGGCGC GGCAGGCAAA CTCTCACACG ACCAAATGCT CACCAGCGTT 
601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 
651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
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701 GCGACGTATT GCGTATCGAC C G CAT C AAGG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA CGgctCCAGC TCCGTTCCGC AAGAatgGCT 

801 G AAAG T CAT C AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

8 51 CGG TTGAAG A AATCGTCGAA GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 

901 ATCGATACCG ACCTGCGCCT CGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TTGATCCGCG CAAATACTTG GGGAAAACCA 

10 01 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GTTATCTTGC GTTCGGTTGC 

10 51 GAAGGTCAGG CAGGCAAAAT CAAACCTGTT TCGTTGGAAA AAATGGCAAG 

1101 CCGTTATGCC AAGGGCGAAT TGAAC CAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORF 034.ng>: 

g034 -pep 

1 MSRLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHAA E NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV I VQ AS AGARK YAGAPFLRHL I LAAVEEFPH 

101 I P WMHQDHG ASPDVCQRSI QLGFSSVMMD GSLLEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAAGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAI AVG TSHGAYKFTR P PTGDVLR I D RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE G I KHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL GKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMASRYA KGELNQIVK* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 123>: 

zn034.seq (partial) 

1 ATGAGCTGTT TATGGTTTTT TGCTGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACT CGT AT C C ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA wACAGCTACG GCyTGCCGGC GTTCAACGTC 

151 AACAACCTCG wACAGATGCG CGCCATCATG GAGGCTGCAG ACCAAGTCGA 

201 CGCCCCCGTC ATCGTACAGG CGAGTGCCGG TGCGCGCAAA TATGCGGGTG 

251 CGCCGTTTTT ACGCCACCTG ATTTTGGCGG CTGTCGAAGT ATTTCCACAC 

301 ATCCCCGTCG TCATGCACCA AGACCACGGC GCATCACCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCTGT AATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACA 

4 51 CGTACCGTGG TTAACTTCTC CCACGCTTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC T CGAAAC CGG CGATGCAGGC GAAGAAG AC G 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT GACCAGCGTC 

601 GAAGATGCCG TATGTTTCGT TAAAGATACC GGCGTTGACG C AT TGGCT AT 

6 51 TGCCGTCGGC ACCAGCCACG G CG CAT AC AA ATTCACCCGT CCGCCCACAG 

701 GCGATGTATT ACGTATCGAC CGGAT CAAAG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA C. . . 



This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

m034.pep (partial) 

1 MSCLWFFAVK NIIIRLIYLL PKETQMALVS MRQLLDHA AE XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV I VQ AS AGARK YAGAPFLRHL ILAAVEVFPH 

101 I P WMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

2 01 EDAVCFVKDT GVDALAI AVG TSHGAYKFTR P PTGDVLR ID RIKEIHQALP 

251 NTHIVMH. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 125>: 

a034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

2 51 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAACGCCACC 
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451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



CGTACCGTGG 
AATCGGCGTA 
GTGTAGGCGC 
GAAGATGCCG 
TGCCGTCGGC 
GCGACGTGTT 
AATACACACA 
GAAAGTCATC 
CGGTTGAAGA 
ATCGATACCG 
TGCCGAAAAC 
TTGAAGCGAT 
GAAGGTCAGG 
CCGTTATGCC 



TTAATTTCTC 
TTGGGCAACC 
AGTGGGCAAA 
TGCGTTTCGT 
ACCAGCCACG 
GCGTATCGAC 
TCGTGATGCA 
AACGAATACG 
AATCGTCGAA 
ACTTGCGCCT 
CCGTCCGACT 
GAAGCAAATC 
CAGGCAAAAT 
AAGGGCGAAT 



CCACGCCTGC 
TCGAAACTGG 
CTTTCCCACG 
TAAAGATACC 
GCGCGTACAA 
CGCATCAAAG 
CGGCTCCAGC 
GCGGCAATAT 
GGCATCAAAC 
TGCTTCCACC 
TCGATCCGCG 
TGCCTCGACC 
CAAACCGGTT 
TGAACCAAAT 



GGCGTATCCG 
CGAAGCCGGC 
ACCAAATGCT 
GGCGTTGACG 
ATTCACCCGT 
AAATCCACCA 
TCCGTTCCGC 
CGGCGAAACC 
ACGGCGTGCG 
GGCGCGGTAC 
CAAATATTTG 
GCTACCTCGC 
TCCTTGGAAA 
CGTCAAATAA 



TTGAAGGCGA 
GAAGAAGACG 
CACCAGCGTC 
CATTGGCGAT 
CCGCCCACAG 
AGCCCTGCCC 
AAGAATGGCT 
TACGGCGTGC 
TAAAGTCAAC 
GCCGCTACCT 
AGCAAAACCA 
GTTCGGTTGC 
AAATGGCAAA 



This corresponds to the amino acid sequence <SEQ ID 126; ORF 034. a>: 

a034 . pep 

1 MSRLWFFAAK Nil IRLIYLL PKETQMALVS MRQLLDHA AE NSYGLPAFNV 

51 NNLEQMRAIM EAADQVNAPV IVQASAGARK YAGAPFLRHL ILAAVEEFPH 

101 IPWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGEAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVRFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMHGSS SVPQEWLKVI NEYGGNIGET YGVPVEEIVE GIKHGVRKVN 

301 IDTDLRLAST GAVRRYLAEN PSDFDPRKYL SKTIEAMKQI CLDRYLAFGC 

351 EGQAGKIKPV SLEKMANRYA KGELNQIVK* 



m034/a034 96.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m0 34 .pep MSCLWFFAVKNII IRLIYLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 

I I I I I M : I I M I I I I I M i M I t I I I I 1 I M I I I f I I I I I I I M I II II I I I I I I I 
a03 4 MSRLWFFAAKNI I IRLI YLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 

10 20 30 40 50 60 



70 80 90 100 110 120 

m034 . pep EAADQVDAPVIVQASAGARKYAGAPFLRHLILAAVEVFPHIPWMHQDHGASPDVCQRSI 
j I 1 I I I : I I 1 I I I i I I I ! I I I ! I M I I I I M I I I I I I I I I I I I I I I I II I I I I I M I I I 
a034 EAADQVNAPVI VQAS AGARKYAGAPFLRHLI LAAVEE FPH I PWMHQDHGAS PDVCQRS I 

70 80 90 100 110 120 



130 140 150 160 170 180 

m034 .pep QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGDAG 
! I I I I M I I I I I I I I I 1 I I M M I I t I I I I I I 1! I I I II I I I I M I I I I II I M I I I : I I 
a034 QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 

130 140 150 160 170 180 



190 200 210 220 230 240 

m034 . pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 
1 M I i I I I I I M I I I I 1 I ! It I M I I I I I I t I I II I M I I I I I I M I I I I I i I I I I I I I 
a03 4 EEDGVGAVGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

190 200 210 220 230 240 



250 

m034 . pep RIKEIHQALPNTHIVMH 
I I I I I M I I I I I I I I I I 

a034 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF (ORF 034.ng) 
from N. gonorrhoeae: 
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m034/g034 






m03 4 . 


.pep 


MSCLWFFAVKNI I IRLI YLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 

1 1 M 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 1 IIIMIMMM MINI 

MSRLWFFAVKNI I IRLI YLLPKETQMALVSMRQLLDHAAENSYGLPAFNVNNLEQMRAIM 


60 


g034 




60 


m034 , 


•pep 


EAADQ VD AP V I VQ AS AGARK Y AG AP F LRH L I LAAVE V F PH I P WMHQDHGAS PDVCQRS I 

1 1 1 1 II : 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 M 1 1 M 1 1 1 1 1 
E AAD Q VN AP V I VQ AS AGAR K Y AG A P F LRH L I LAAVEE FPH I P WMHQDHGAS PDVCQRS I 


120 


g034 




120 


m034 , 


.pep 


QLGFS S VMMDGS LMEDGKTPS S YE YNVNATRTWNFSHACGVS VEGE I GVLGNLETGDAG 

IIIIIIIIIMIhlMIIIIIIIIMIIIIIIIIIIIIIMIMIIIIIIMIIIhll 

QLGFSSVMMDGSLLEDGKTPSSYEYNVNATRTVVNFSHACGVSVEGEIGVLGNLETGEAG 


180 


g034 




180 


m034 


.pep 


EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALiAIAVGTSHGAYKFTRPPTGDVLRID 

llllllhllMliilllllllll IIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIII 

EEDGVGAAGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 


240 


g034 




240 


m034 


.pep 


R I KE I HQ AL PNTH I VMH 

IIIIIIIIMIIIIIII 

RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 


257 


g034 




300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036 . seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttgcCTGC 

51 GCGGACTTCG AG C AG C AG G C GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

2 01 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

4 51 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CACGCATAAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

7 51 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

8 01 ATACGCAGTC CGGTAA 



This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036 .pep 

1 MLKPCLVYSA CAAAL PARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRRSAYRV CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

2 51 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 129>: 

m036 . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg . CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 

451 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGCG CGAACGCGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGAGCG TCCGTAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

7 51 TTAAGGGGGT ATCAAACGGC GTTGCCAAAC CCTGAGCTAC ACCGATGCCG 

801 ATACGCAGTC CGGTAA 



This corresponds to the amino acid sequence <SEQ ID 130; ORF 036>: 

m036 .pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD A I PWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

251 LRGYQTALPN PELHRCRYAV R* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 131>: 

a03 6 . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGCAT CGAGCGCGGC GAGTGCGGCG CAATCGGCAT AAACGGCGCG 

351 GCGGATGTTC ACAGGCGCGC CCTCCGTTCC GCCTGTTCTT TGGCAGTCAA 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAGCTC GGCGTGTGCC GCAGCGACGG 

4 51 CGGGAAAACC GCCTTCAGCC GCCAGATTGA GGAAGTCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGCG CGAACGCATT 

551 GCCGATGCCG TCTGAAAAGG CGCATCCTTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGACCGAC CTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGTGCG TCCGCAGCGG CAAGGATGCA 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

7 51 TTAAAGGAGT ATCAAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 132; ORF 036. a>: 

a036 . pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AI PWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA*TARRMF TGAPSVPPVL WQSRRFCCGR RAARRVPQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

201 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

251 LKEYQTALPN LAPRRCRYAV P* 

m036/a036 85.6% identity over a 270 aa overlap 



10 20 30 40 50 60 

m036.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I I I 1 I I I I! I M I I M I I I I I M I I I I I I I 1 M I I I M I M I I I M I I I I I I I I I I I I I I 
a036 MLKPCAVYSACAAVL PARTS SSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m036.pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I II I I I 1 I I I I I I I I I II I I II I I I I I I I I I I I M M I I I I I I I I I I : : I I I M M I I 
a036 GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASAAQSAXTARRMF 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m03 6 .pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 

Ml I j I I I I I I I I : I : : I I I I 1 I : M I I I I I I : : I I 

a03 6 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036 . pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
I I : ! I I I I I I I I I I I I I I I I I I I I I I I I : M I M I I I I I I I M I I I I I I I I 1 I ! 
a03 6 RTHCRCRLKRRILPAAGCLPPDRPDNRSNGGGSACRTMHKTLRPYVRPQRQGCSFAAAAA 

190 200 210 220 230 240 



250 260 270 

m03 6 . pep RRRHRARVRRLRG YQTALPN PELHRCRYAVRX 

I I M I I I I I I I : I 1 I I I t I : t I I I I I 
a03 6 RRRHRARVRRLKE YQTALPN LA PRRCRYAVPX 

250 260 270 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036.ng) 
from N. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
Mill | I | | I M : I I I I I I I I I i I I I I I I : I I I I I I II I i M I I I I I I I I I I I I I 
MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

GRFCQT I KAAI PXS FS ARKTCS DGETS ADSNWRCVHADGLQT AS S AAS S SQSAQT ARRMF 

M I I I I I I M I I M I II I I I I 11 I I I I I I II I I I : I 11 I I :: I I I III 

GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 
70 80 90 100 110 120 



m036 . pep 
g036 

m036 . pep 
g036 



130 140 150 160 170 180 

m036 . pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: 11 I I I I I I I I I I I I I I I III : : I : I I I : II : I I I MM I : : I : 

g 036 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m036 .pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 
M : I I M M I I : : II I I I II M II I M I II I I M I M II 1 : II I I I I M 
g03 6 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 

190 200 210 220 230 240 



250 260 270 

m03 6 . pep RRRHRARVRRLRG YQTALPN PELHRCRYAVRX 

I I II II II:: : II I I I : I I II M II 
g036 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 

250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 133>: 

m036-l . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

2 01 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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3 51 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG GCCTGTTCTT TGGCAGTCAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAAACC GGCGTGTGCG GCATGGACGG 
4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA GAAAGTCGGC GGCAGTCGGC 
501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT TCTTCCAGGG CGAACGCGCT 
551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC CGGCAGCCGG ATGTCTGCCG 
601 CCCGCCCGAC CTGATAATCG CTCAAACGGT GGCAGTTCAG CGTATCGAAC 
651 CAT G CAT AAA ACACTTCGCC CATACGAGCG TCCGTAG 

This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036-1 >: 

m03 6- 1 . pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AI PWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVR PVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP* 

m036-i/g036 76.8% identity in 228 aa overlap 

10 20 30 40 50 60 

MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I I ! i 1 M I I I II : I I I I I I 1 I ! 1 M MM: I I I I I M I I M I I II M M I II I I I 
MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 
10 20 30 40 50 60 

70 80 90 100 110 120 

GRFCQTIKAAIPPSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
I M M I I M I I I I M I II I I I M M I M II M I I I M II M : M M I : : M I Ml 
GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 
70 80 90 100 110 120 

130 140 150 160 170 180 

TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAY PVCLRTAELLPA 
: II I I I II M I I I I M I I III : MM I I M I M I I I I I I I : : I : 

MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 
130 140 150 160 170 180 

190 200 210 220 229 

RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 
t I M I I I I II I : : I I I I I It I I M I M I II I I I I II I II 
RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 
190 200 210 220 230 240 

RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 135>: 

g038 . seq 

1 ATGACTGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCAATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG AC CGCATGGA AAAAGGCACG GGTAAATTGT CCGCCGTTCA 

501 GGAAGTGGAA AAACAAT AC G GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GCACCTACCG C C GGCAGT AC GGCGTAGAAT AA 



m03 6- 1 . pep 
g036 

m03 6- 1 . pep 

g036 

m03 6-1 . pep 
g036 

m03 6-1 . pep 
g036 

g036 
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This corresponds to the amino acid sequence <SEQ ID 136; ORF 038. ng>: 

g038.pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGASTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDRGEGGVL VGAPLKGRVL IIDDVTSAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GKLSAVQEVE KQY GLFVAPI ASLNDLFILL QNNPE FGQFL 

201 EPVRTYRRQY GVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 137>: 

m038 . seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAAT C CAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATC GACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGCGCTCG AT C GCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAAT AC G GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 138; ORF 038>: 

m038 .pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVTSAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPE FGQFL 

201 EPVRAYRRQY GVE* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 139>: 

a038 . seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTCA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

2 51 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

4 01 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACC GCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 140; ORF 038. a>: 

a038 .pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPE FGQFL 

201 EPVRAYRRQY GVE* 

m038/a038 100.0% identity over a 213 aa overlap 



10 20 30 40 50 60 

m0 38 .pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSI IES 
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I I M I f I I I I I I I I I I I i I I I I I I I I I I 1 I II I M ! I I I I ! I I I I I I I I M I I I I I I 1 I I 
a038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 8 .pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
I I I I M 1 I I I I I I M I I I II I M I I I I II I M I I I I I I I I I I I II i I II M I I II M M I 
a038 GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 
I I I I M I I II M I I I I I I I I I I I I I I I I I f I II I I I M I II t I M I I I 1 II I I I I i I 1 M 
a038 IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

m038 . pep ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 
I I II It I 1 M I M M 1 I M 1 1 t M I I M M M II 
a03 8 ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV, gonorrhoeae 

ORF 038 shows 98.1% identity over a 213 aa overlap with a predicted ORF (ORF 038.ng) 
from TV. gonorrhoeae: 

m038/g038 

10 20 30 40 50 60 

m038 .pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
I I I I I I I I II I M I I I I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I M I I f 
g038 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGASTLQLAKFYAQSIIES 

10 20 30 40 50 60 

70 80 90 100 110 . 120 

m036 . pep GIRFDMLFGPAYKGIILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 
I M I I I I I I I I I i I I I I M M I I I I I I I I I I I I I I I I I I I I I t : I I M M I I I I I M I I I 

g038 GIRFDMLFGPAYKGI ILAAATAMMLAEKGVNVPFAYNRKEAKDRGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038 . pep IIDDVISAGTSVRESIKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

I 1 t I I I I I I M I t I I I I I 1 1 I I I I I I I I ! I I I I ) I 1 II I I I : I I I M I ! I I I I I I I 1 I 1 I 
g03 8 IIDDVISAGTSVRESIKLIEAEGAT PAGVAIALDRMEKGTGKLSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

m038 . pep ASLNDLFI LLQNNPE FGQFLE PVRAYRRQYGVEX 

II I I I I I ! I I II I I I I I I I I I I I I : I I II I I I I I 
g038 ASLNDLFI LLQNNPE FGQFLE PVRTYRRQYGVEX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 141>: 

g039 . seq 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GGCATCAAAC CGACACACAC 

51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcgtCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 
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251 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

3 01 gaaatCgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

3 51 CGCAACCGCC GCTGCCGCac ctGCCGCCGC ACCGCaggTT TCCGTACCGC 
401 CCGCCCGTCA GGAAGGGCTC AACTGGACTA TTGCAACCCT GTTCGCACTT 

4 51 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 



This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g0 3 9 .pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VKLVHRIGTH AISKKQMSRD 

101 EIADILNGGT TLHDTPPATA AAAPAAAPQV SVPPARQEGL NWTIATLFAL 

151 IVLIMQLSYL FIL* 



The following partial DNA sequence 



m03 9 . seq 






1 


ATGCCGTCCG 


AACCGC CTT A 


51 


CGAGGAAATA 


CCATGCCCGC 


101 


CTGGGTCAAA 


GAAACCCAAC 


151 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


251 


CGGATGTCAA 


ACTCGTTCAC 


301 


CAGATTT C C C 


GTGACGAAAT 


351 


GCCCGATATT 


CCGCCCGCAA 


401 


TTACCGTACC 


GCCCGCCGCG 


451 


ATTGCAACCC 


TGTTTGCCCT 


501 


CGTCATCCTA 


TGA 



identified in N. meningitidis <SEQ ID 143>: 

CGCCTCAGAC GG CAT C AAAC CTGACACACA 
CTGTTTCTGC CCCCACTGCA AAACCCGTCT 
TCAATGTCGC CGnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnCCC GAGGCTGTTT 
CGTATCGGCA CGCGCGCCAT CGGCAAGAAA 
CGCCGGCATC CTCAACGGCG GTACAACCCA 
CCGCCGCCAC CCCTGCTGCC GCACCGCAGG 
CCCGCCCGTC AGGATGGGTT CAACTGGACG 
TATCGTCCTC ATTATGCAGC TTTCCTACCT 



This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

m03 9.pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 IATLFALIVL IMQLSYLVIL * 



The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 145>: 

a039 . seq 

1 ATGCCGTCTG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CCAAGGCTTC GTCGTCTGCC 

151 AAAAAT GCG A AGGAATGTTT AAAGCCAAAG ACCATCTGGC AAG C ACG AAA 

201 GAACCCATAT TCAACGATT - TGCCCGAAGC TGTTTCGGAT GTCAAACTCG 

251 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

4 01 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

4 51 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039. a>: 

a03 9 . pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKNPY STIXPEAVSD VKLVHRIGTS AIGKKQISRD 

101 EIAGILNGGT TQPDIPPATA ATPAAAPQVT VPPAAPARQD GFNWTIA TLF 

151 ALIVLIMQLS YLVIL* 



mo39/a039 79.4% identity over a 170 aa overlap 



10 20 30 40 50 60 

m039 .pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

i I I I I I I I I I! I I M I I I I I I I I I I I I I I I I I I I I I M I II I M 
a03 9 MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 
: : ) 1 I I I I I I 1 1 I I I II I I I I I ! ! M I j I I I I I I M I M I I I I 

a 0 3 9 TIWQARKNPYSTIX PEAVSDVKLVHRIGTSAIGKKQISRDEIAGILNGGTTQPDI 

70 80 90 100 110 

130 140 150 160 170 

m03 9 . pep PPATAAT PAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 

I M I i I I I I I i I I I I I I I I I I I i I M I I! I I I M M I I M I I I I I I I I I I I 
a 03 9 PPATAATPAAAPQVTVPPAAPARQDGFNWTIATLFALIVLIMQLSYLVILX 
120 130 140 150 160 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 039 shows 60.8% identity over a 171 aa overlap with a predicted ORF (ORF 039. ng) 
from TV. gonorrhoeae: 

m039/g039 

10 20 30 40 50 60 

MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

IIMM Mlllll I MMh MMMMIMM M 

MPSEPPAASDGIKPTHTEKTSCPPVSVRTAKPASGSKKPSSTSPKASSSAKNAKGCLKPK 
10 20 30 40 50 60 



m03 9 . pep 
9039 



70 80 90 100 110 120 

m03 9 . pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQISRDEIAGILNGGTTQPDI 

h M I M I I I I h M M I h I I I I I 1 MMM1 I 

gO 3 9 TIWQARKNLYSTIG PKLFRDVKLVHRIGTHAISKKQMSRDEIADILNGGTTLHDT 

70 80 90 100 110 



130 140 150 160 170 

m03 9 pep PPATAAT- PAAAPQVTVPPAAPARQDGFNWTIATLFAL,IVLIMQLSYLVILX 

llllll: 1 1 I M 1 1 M 1 1 1 M M M 1 1 M I I M I M M I I M M III 

g03 9 P P AT AAAAP AAAPQVS VP P A RQEGLNWT I ATLFALI VLI MQLS YLF I LX 

120 130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 147>: 

g040 . seq 

1 ATGAACGCGC CCGACAGCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA CGACACTGGT CGCCGGCATA GAcggCCGCC 

101 TGCTCGAAGG CGGCACCTTA AATAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCCGACT CGTCCTCATC CACGGCGCGT ACCACTTCCT 

201 CGAccgCCTC GCCGCCGCGC AAGgccGCAC GCCGCATTAT TGCCGgggtt 

251 tGCGCGTTAC CGACGaAACc tcGctcgGAC AGGCGCAGCA GtttGCCGGC 

301 AccgTCCGCA GCCGTTTTGA agcCGCATTG tgcggcagCG tttcaggatt 

351 cgcgCGCGCG CCTTCCGTCC CGCTCGTAtc gggcaacttc ctgacCGCCC 

4 01 GTCcgatggg cgtgattgac ggaACCGata tggaatacgc gggggttatc 

4 51 cgcaaaaccg ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGATG CCGCCGCTCG GGCATTCCTA CGGCGGCAAA ACCTTCAATC 

551 TCGATATGGT GCAGGCCGCC GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

7 01 CCGCCAGCGA AACCCGACGA CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

7 51 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 

8 01 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 
8 51 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATC 
901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCGTCCTAT TGCACCGCAG 
951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 
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1001 ACGGCGACCT GTACGGCTGT 

1051 TGCGGCGAAA TCGCCTGCCT 

1101 ctACGGCGAA CGCCTGCTTG 

1151 GCATAAGCAG GCTGTTCGCA 

1201 GAACGCGGCT TTCAGACGGC 

1251 CAAAGACTAC CGCAGCAACG 

1301 TGCACCGCTG A 



GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

CCCACATTAT CGATAAGGCG CGCGGCATAG 

CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

GACGAAACCC GCATATTCTG GTGCGTCGCC 



This corresponds to the amino acid sequence <SEQ ID 148; ORF 040.ng>: 

g040 .pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI DGRLLEGGTL NKLAADIGLL 

51 SQLGIRLVLI HGAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GVLLHRSREY LENHISEFSI LEHDGDLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEW FA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNPHIL VRRLHR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

7 51 AACGGCATCG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G . CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs . c GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag . cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CACATTATCG ATAAGGCGCG CGGCATAGGC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAG ACT AC C G CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m04 0 . pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NKL AADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HIIDKARGIG ISRLFALSTN TGEWFAERGF QTASEDELPE TRRKDYRSNG 

4 01 RNSHILVRRL HR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 151>: 

a040 . seq 
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1 ATGATCGTGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGTTCG CCGCCGACAT CGGGCTTTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCGC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCCATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGTA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

701 CCGGCGGCGA AACGCGACGG CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

7 51 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 

801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

851 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATT 

901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCATCCTGC TGCACCGCAG 

951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 

1001 ACGGCAACCT GTACGGTTGC GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGG 

1101 CTACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 

12 51 CAAAGACTAC CGCAGCAACG GACGGAACTC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 152; ORF 040. a>: 

a040._ 

FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKFAADIGLL 

HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDI PHI 

GILLHRSREY LENHISEFSI LEHDGNLYGC AALKTFAEAD 

PQAQDGGYGE RLLAHIIDKA RGIGISRLFA LSTNTGEWFA 
ELPETRRKDY RSNGRNSHIL VRRLHR* 

m040/a040 91.5% identity in 436 aa overlap 

10 20 30 40 50 60 

m04 0.pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKIAADIGLLSQLGIRLVLI 
j : I I I I I I I I M I : I I I I I I I I I t I I 1 I I 11 I I I I I I I I I I : I I II I I I I I I I I II I 1 I 
a0 4 0 MIVPDLFVAHFREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLLSQLGIRLVLI 

10 - 20 30 40 50 60 

70 80 90 100 110 120 

m04 0.pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 
I M I I I I I I I I I I I I I I I I I I II 1 II I I 1 I I I I I II I I I II I I M I I I I I I I I 11 I I I I I 
a04 0 HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 0 . pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
I I 1 11 I I I I I I I II I I II I I M I II I I I I I I I I 1 I II I I I I I I II M I I I I I I I I 11 it I 
a04 0 PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 

130 140 150 160 170 180 

190 200 210 220 230 240 

m04 0 .pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 
I I : I M I I I I I I : I i I M | M I I I I I | | | | | | M | | | | | I I I I I I I I II I I ! I I I : I I I 
a04 0 TFHLDMLQTAASVAVSLQAEKLVYLTLSDGISRPDGTLAVTLSAQEAQSLAEHAGGETRR 



pep 




1 


MIVPDLFVAH 


51 


SQLGIRLVLI 


101 


TVRSRFEAAL 


151 


RKTDTAALRF 


201 


KLVYLTLSDG 


251 


GGVHRVQILN 


301 


AALIRPLEEQ 


351 


CGEIACLAVS 


401 


ERG FQTASED 
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190 200 210 220 230 240 



250 260 270 

m04 0 . pep LISSA ELFTRNGIGTS I AKEAFVS I RQAHXXD I PHI 

Mill I I I M I I I I I I M I I II I M I I I I MM! 

a04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 

250 260 270 280 290 300 

280 290 300 310 320 330 

m04 0 . pep AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 
M I I I I I II I I M 11 I M I II II I II i I II I I I II II II II II I II II I I II II I I II I 
a04 0 AALIRPLEEQGILLHRSREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 

310 320 330 340 350 360 

340 350 360 370 380 390 

m04 0 .pep PQXQDGGYGERXLAHI IDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

M I! I II I M I M II I I II M M M II I I M I I M M I II I I M M II I I I M I II II 
a04 0 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

370 380 390 400 410 420 

400 410 
m04 0 .pep RSNGRNSHILVRRLHRX 
M I I I M I I M II I I II 
a04 0 RSNGRNSHILVRRLHRX 

430 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from N. gonorrhoeae: 

m040/g040 



m04 0 . pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

Mill I II I! I 1 I : I I I I I M I I II II II Mill I I I II I M M M I II I M I I M 
g 04 0 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m04 0 . pep HGARHFLDRHAAAQGRTPHYCRGLRVT DETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 

Ml Mill I I I II I II I I I I I M II 11 II I I M I I I I II I I I I I M M M I II I M I 
g04 0 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

m04 0 . pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 180 

II II I II II M II I I : I M I II I I I I M M I I 11 I II I I II M I M M I : I I I M M : M 
g04 0 PSVPLVSGNFLTARPMGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWMPPLGHSYGGK 180 

m04 0 . pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 24 0 

M II I : I : I I M I M I I I I I I I M I II II I M I II I I I I I I II I II M II I M ::: II I 
g04 0 TFNLDMVQAAASVAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAASETRR 24 0 

m040 .pep LISSA ELFTRNGIGTS IAKEAFVSIRQAHXXDI PHI 27 6 

Mill M I M II I I I II I I II II I I M I I I I I I I 

g04 0 LISSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTSIAKEAFVSIRQAHSGDIPHI 300 

m040 .pep AALI RPLEEQG I LLHRXRE YLENHI SE FS I LEHDGNLYGCAALKTFAEADCGE I ACLAVS 336 

11 M I II I I I I : I I I I I II M I II I I M II I II I : M I I I II I I I M I I I I M I I I I I I 
g04 0 AALIRPLEEQGVLLHRSREYLENHISEFSILEHDGDLYGCAALKTFAEADCGEIACLAVS 3 60 

m040.pep PQXQDGGYGERXLAHI I DKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 396 

M I I M I I I I I I I I M I I II I I I I II I I I I I II I I II I II II II I I I I I I M I I II II 
g04 0 PQAQDGGYGERLLAHIIDKARGIGISRLFALSTNTGEWFAERG FQTASEDELPETRRKDY 4 20 

m04O.pep RSNGRNSHILVRRLHRX 413 

II M I I II I I I I I I I I 
g040 RSNGRNPHILVRRLHRX 437 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 153>: 

g04 1 . seq 

1 ATGAGTTCGC CCAAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTTATCCGCT GCTGTCCGCC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

201 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 

301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAATTTTTGG GATAA 

This corresponds to the amino acid sequence <SEQ ID 154; ORF 041.ng>: 

g041 . pep 

1 MSSPKHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWTDEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 155>: 

m04 1 . seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ID 156; ORF 04 1>: 

m04 1 . pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 

151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 157>: 

a041 . seq 

1 ATCAGTTCGC CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 

51 GATTACTGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATA GGCGCGCTGG 

101 TGTGCGAAGT GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 

151 GGTTCAAGCT GGACAGACGA ATACGGCAAT CCGCAAAAAT ACGAAGTCTG 

201 CAAACGCCGG TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

251 TCGATTATCC GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

301 CATCCCGCCC ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCGCC 

351 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CGCAGCGCGA AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTGAAA 

4 51 GAGTTTTTGG GCTAA 

This corresponds to the amino acid sequence <SEQ ED 158; ORF 041. a>: 

a041 . pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG* 



m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 40 50 60 

m041 .pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLT DMIRYPLLSAGSSWTDEYGN 

I I I I I I I I I I I I I I I I I I M I I I II I I II I I M I I I I M I I I I I I II I I I I I I I I I I I f I 
a041 ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 1 . pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I I I I I I II I M M i II I M I! I II I I II I I i I I I I I I I I I I M I M I M II I I I I III 
a041 PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 



130 140 150 

m04 1 . pep LYS PDGGGHTGNGTQRE S ADELACVLLFLKE FLGX 

I I I I I I I I I I I I I I M I : I I I I I I I I I I II I I II I 
a041 LYSPDGGGHTGNGTQREAADELACVLLFLKE FLGX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041. ng) 
from N. gonorrhoeae: 

m041/g041 

10 20 30 40 50 60 

m04 1 . pep ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLT DMIRYPLLSAGSSWTDEYGN 

: | | | : | M I I I I I I I I [ I I I II I I I It I I II I I I I I I I I I I I I II I I I I I II I I I I II I I 
g 041 MSSPKHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m041 .pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I I I I : I I I I I 1 I I I I I I I I I I I I II I I II I I I I I M I II I I I I I I I M I I I I I I I Ml 
g041 PQKYEACKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 



130 140 150 

m041 .pep LYS PDGGGHTGNGTQRE S ADELACVLLFLKE FLGX 

I I I I I I I I M I M I I I I I I I : I I M I I I I I I I I I I 
g 0 4 1 LYS PDGGGHTGNGTQRE S ADKLAC VLL FLKE FLGX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 159>: 

g041-l . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCG CGCGCACTTT CAGACGGCAT TTTGAATCAA 

151 ATGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

2 01 GTACCATTTC CAT C AGAATG CGGAATATCC GAAGGGCGTG TACCGCATGT 

2 51 GTACGGCGGC GACCTACCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGATGA GTTGCTCGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAGC AGCCCAACCG CGCGCTGCTG ACTTTGAACA 

4 01 AATCGGGCGG CGATACGGCG TATACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAATTGGTAG AGGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 

551 AGTTGACCGA ATCGGGCTAT CCGCGCGAAG TGTGGCTGGT GGAACGCGGC 



WO 99/057280 



PCT/US99/09346 



232 



601 AAGAGTTTCG AGGAAAGCCT GCCGGCGTAC CAAATCGATA AAGGCGCGAT 

651 GATGGTAAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCGGACGGT TTTTACACCA AGACGTATTT GCAGGTGTCG 

7 51 TCCGAAGGCG GGGCGAAACC "GTTGAACCTG CCTAATGATT GCGATGTGGT 

801 CGGCTATCTG GCGGGACATC TTTTGCTGAC GCTGCGCAAG GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCGT TGGTGGCGGT GAAACTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCAAGCCTGC 

1001 TGGAGAATGT ACAAGGCCGT CTGAAAGCGT GGCGGTTTGC CGACAGCAAA 

10 51 TGGCAGGAAG CCGAGTTGCC GCACCTGCCC TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCG TGGGGCGGCG ACGTGGTTTA TCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC TCCAGCCGCA GCAGTTTGTT TCAGACGGCA TCGAAGTGCG 

1251 GCAGTTTTGG GCGGTGTCGT CCGACGGCGA ACGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACACGCCGA CCTTAGTCTA TGCTTACGGA 

1351 GGTTTCGGCA TTCCTGAATT GCCGCATTAT CTGGGCAGCG TCGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCAAACATC CGCGGCGGCG 

14 51 GAGAATTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAC 

1501 AAAAGCGTTG ATGATTTGTT GGCAGTCGTG CGTGATTTGT CCGAACGCGG 

1551 CATGAGTTCG CCCAAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGCC 

1601 TGATTACCGC CGCCGCCTTC GTGCGCGAAC C G C AAAG CAT CGGTGCGCTG 

1651 GTGTGCGAAG TACCGCTGAC CGATATGATC CGTTATCCGC TGCTGTCCGC 

17 01 CGGTTCAAGT TGGACGGACG AATACGGCAA TCCGCAGAAA TACGAAGCCT 

17 51 GCAAACGCCG GCTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 AT C GAT TAT C CGCCCGCACT CATTACCACC AGCCTCAGCG ACGACCGCGT 

1851 CCATCCCGCC CACGCGCTCA AATTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CAAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2001 AGAATTTTTG GGATAA 



This corresponds to the amino acid sequence <SEQ ID 160; ORF 041-1. ng> 

g04 1-1 .pep 

FENLDSAETQ 
QEHRARMYHF 
DDVYLGGVSH 
AGKNHVSWRD 
QIDKGAMMVN 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



MKSYPDPYRH 
MQDTRQIPFC 
SVADFDELLG 
ELVEGGFHFP 
KSFEESLPAY 
SEGGAKPLNL 
RGELGAAQLL 
WQEAELPHLP 
VMRLQPQQFV 
GFGIPELPHY 
KSVDDLLAW 
VCEVPLTDMI 
IDYPPALITT 



RARFLNN DKA 
YRMCTAATYR 
TLNKSGGDTA 
DERQLTESGY 
PIDLIEASDG 
DWHRANQSYP 



NFAAEANAET 
HQNAEYPKGV 
LVEQPNRALL 
ENSVWVCPAW 
AWRYLDPQGS 
PNDCDWGYL AGHLLLTLRK 
FAPDETQALE SVETTKRFW ASLLENVQGR 
SGALEMTDQP WGGDWYLAA SDFTTPLTLF 
SDGIEVRQFW AVSSDGERIP YFHVGKNAAP 
LGSVGKYWLE EGNAFVLAN I RGGGEFGPRW 
RDLSERGMSS PKHIGLQGGS NGGLITAAAF 
RYPLLSAGSS WTDEYGNPQK YEACKRRLGE 
SLSDDRVHPA HALKFYAKLR ETSPQSWLYS 



RALSDGILNQ 
SGYPEWKILF 
YTLEVDLEAG 
PREVWLVERG 
FYTKTYLQVS 
SGALVAVKLN 
LKAWRFADSK 
ALDLNVMELT 
DTPTLVYAYG 
HQAAQGISKH 
VRE PQSIGAL 
LSPYHNLSDG 
PDGGGHTGNG 



TQRESADKLA CVLLFLKEFL G* 



The following partial DNA sequence was 

m04 1-1 . seq 

1 ATGAAATCCT ACCCCGACCC 

51 CGAAACGCAA AACTT CGCTG 

101 TTTTAGAAAA CGACAAGGCG 

151 TTGCAGGACA CGCGGCAGAT 

201 GTACCATTTC CATCAGGACG 

251 GTACCGCGGC G AC GT AT C G T 

301 TCGGTGGCGG ATTTCGACGA 

351 CGTGTCGCAC TTGGTGGAAC 

4 01 AATTGGGCAG CGATACGGCG 

4 51 GAGTTGGTCG AAGGCGGTTT 

501 GTGGCGCGAT GAAAACAGCG 

551 AGTTGACCCA ATCGGGCTAT 

601 AAGAGTTTCG AGGAAAGCCT 

651 GATGGTGAAC GCGTGGCGTT 

701 TGATTGAAGC GTCGGACGGT 



identified in N. meningitidis <SEQ ID 161>: 



CTACCGCCAT 
CTGAAGCGAA 
CGCGCGCTTT 
TCCGTTTTGT 
CGGAGTATCC 
TCCGGCTATC 
ATTGCTTGGC 
AGCCCAACCG 
TACACGCTGG 
TCACTTTCCG 
TGTGGGTGTG 
CCGCGCGAAG 
GCCTGTGTAT 
ATCTCGATCC 
TTTTACACCA 



TTTGAAAACC 
TGCCGAAACG 
CAGACGGCAT 
CAGGAACACC 
GAAGGGCGTG 
CCGAGTGGAA 
GACGATGTGT 
CGCGTTGTTA 
AAGTGGATTT 
GCAGGCAAAA 
TCCGGCTTGG 
TATGGCTGGT 
CAAATCGGCG 
GCAGGGTTCG 
AAACCTATTT 



TCGATTCCGC 
CGCGCGCGTT 
TTTGGCGCAG 
GCGCGCGGAT 
TACCGCGTGT 
AATCCTGTTT 
ATTTGGGCGG 
ACACTGAGCA 
GGAAGCAGGG 
ACCATGTGTC 
AACGAACGCC 
GGAACGCGGC 
AAGACGGCAT 
CCGATTGATT 
GCGGGTCTCA 
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7 51 GCCGAAGGCG AGGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTGGT 

801 CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 

851 GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 

1001 TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 

1051 TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

12 01 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

12 51 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 

14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC T G G AC AG AC G AATACGGCAA TCCGCAAAAA TACGAAGTCT 

17 51 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 
1801 ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

18 51 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 
1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 
1951 ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 
2 001 AGAGTTTTTG GGCTAA 



This corresponds to the amino acid sequence <SEQ ID 162; ORF 041 -1>: 

mO 4 1 - 1 .pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDWGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFW ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDWYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

4 51 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAW RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGNG 

651 TQRESADELA CVLLFLKEFL G* 



m041-l/gO41-X 94.6% identity in 671 aa overlap 



10 20 30 40 50 60 

m041-l .pep MKSYPDPYRH FEN LDSAETQNFAAEANAETRARFLENDKARALSDG I LAQ LQDTRQIPFC 
I I I I M ! I I I ! I I I M I I I I I I I I M I I I I I I t I i : II I I I i It I I I I !: I I I I I I II I 
g041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILNQMQDTRQIPFC 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 1-1 .pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I! I I I II I I I : I I I I I I I I I : I I I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I 
g041-l QEHRARMYHFHQNAEYPKGVYRMCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 



130 140 150 160 170 180 

m041-l . pep LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I I I It I I I I I I I : I I : I I I I I I I I I II M I I I I M I I I I i I I I I I I I I II I I I I I I I I I 
g041-l LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m041-l.pep NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGS PIDLIEASDG 
: I I I I I : I I I I I I I I I I I I I I I I I I I I I : I I I : : I I M I I I I II M I I I I I I I I I I I I 
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g041-l DERQLTESGYPREVWLVERGKSFEESLPAYQIDKGAMMVNAWRYLDPQGSPIDLIEASDG 

190 200 210 220 230 240 

250 260 270 280 " 290 300 

m04 1-1 . pep FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 
| I | | | |[ : I I : II I M i I I I I I I I I I I I M If I M I I I i ! I : I I I M M I I ! M i i I i I 
g041-l FYTKTYLQVSSEGGAKPLNLPNDCDWGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m041-l . pep RGELGAAQLLFAPDETQALESVETTKRFWASLLENVQGRLKAWRFADGKWQEVELPRLP 
M ! I I I I I I I I I I I M I M I I I I I I I M I M I I I I I I I I I I I I I I I I I : M IE : I II : I I 
g041-l RGELGAAQLLFAPDETQALESVETTKRFWASLLENVQGRLKAWRFADSKWQEAELPHLP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m041-l .pep SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I I I M M 11 I I I 1 II I I I I I LI I I I I I I I I I M I I I 11 I I I I INN I I t I = (-III 
g041-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

m041-l.pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
:: I : I I I M I I I I I II I I I I I I I 1 I I I I I I I I I 1 I M I M I I : I I I It I M I I I I I I M 
• g041-l AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGIPELPHYLGSVGKYWLEEGNAFVLANI 

4 30 440 450 4 60 470 480 

490 500 510 520 530 540 

m041-l.pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
I | M II I I I I I I I I I I I 1 M I I 1 I 1 I I I M I I I I I I 1 : I II : I! I M I I I I I II 1 I I I M 
g041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAWRDLSERGMSSPKHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 



550 560 570 580 590 600 

m041-l - pep VREPQSIGALVCEVPLT DMIRYPLLSAGSSWTDEYGN PQKYEVCKRRLGELSPYHNLSDG 
I | | | | | | | I I I I I I I I I I I I I I I I I I I II I I I I I M I I I M I : I I I 1 M I I I I I I I II II 
g041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

m041-l .pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 
M I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I 1 I I : I I 
g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQRESADKLA 

610 620 630 640 650 660 



670 

m041-l .pep CVLLFLKEFLGX 
I I I I I M I I I I i 
g04 1-1 CVLLFLKEFLGX 

670 



m041-l/P55577 

sp I P55577 | Y4NA_RHISN PROBABLE PEPTIDASE Y4NA >gi I 2182536 {AE000086) Y4nA [Rhizobium sp . 
NGR234] Length = 726 
Score = 370 bits (940), Expect = e-101 

Identities = 217/682 (31%), Positives = 331/682 (47%), Gaps = 22/682 (3%) 



Query: 


2 


KSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFCQ 


61 




K DP + +D + + N T + ++ + L LQ T +1 




Sbjct : 


42 


KDASDPRAYLNEIDGDKAMTWVEAHNLSTVDKLSKDPRYSEYQADALTILQATDRIASPS 


101 


Query : 


62 


EHRARMY-HFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 


120 




R M +F QD + +G++R T +YRSG P+W+ + V + G G 




Sbjct: 


102 


FARDGMIDNFWQDGTHVQGLWRRTTWESYRSGNPQWRTILDVDALSKAEGKTWVFEGGDC 


161 


Query: 


121 


LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 


180 



L N L+ LS G D E D+ GE V+ GF P GK V+W DEN+++V W 
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Sbjct: 162 LPPTSNLCLIRLSDGGKDADVVREFDIAKGEFVKEGFVLPEGKQSVTWVDENTIYVTREW 221 

Query: 181 NERQLTQSG YPREVWLVERGKS FEESLPVYQ IGEDGMM — VNAWRYLDPQGSPI 232 

++T SGY +V+RG+S ++++ +++ E G++ ++ +D + 

Sbjct: 222 TPGEVTSSGYAYVTKVVKRGQSLDQAVEIFRGQKKDVSAERGVLRDIDGKYVMDTSYRGL 281 

Query: 23 3 DLIEASDGFYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQS-YPS 291 

D FY + + L LP GY G + L+ DW A + + + 

Sbjct: 282 DFFNTELAFYPNGH PDTRKVVLPLPTTAVFSGYYKGQAI YWLKSDWTSAKGTVFHN 337 

Query: 2 92 GALVAVKLNRGELGAAQL LFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFA 347 

GA++A L A++ LF P+E Q++ TK +V S+L NV + + + + F 

Sbjct: 33 8 GAI I AFDLKAALADPARVEPLVLFMPNEHQSVAGTTQTKNRLVLSI LSNVTSEVRS FDFG 397 

Query: 34 8 DGKWQEVELPRLPSGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQ 4 07 

G W +L + L +T D +++ + F P TLF D ++ + P 

Sbjct: 398 KGGWSSFKLALPENSTLSLTSSDDESDQLFVFSEGFLEPSTLFCADAATGQVEKITSTPA 457 

Query: 408 QFDSDGINVQQFWTTSADGERIPYFHVGKNAAP DMPTLVYAYGGFGIPELPHYLGSI 4 64 

+ FD+ G+ QQFW TS DG ++PYF V + PT++YAYGGF IP P Y + 

Sbjct: 458 RFDAGGLQAQQFWATSKDGTKVPYFLVARKDVKLDGTNPTI LYAYGGFQI PMQPS YSAVL 517 

Query: 465 GKYWLEEGNAFVLANIRGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHI 524 

GK WLE+G A+ LANIRGGGEFGP+WH A ++ + DD AV +DL + ++S H+ 
Sbjct: 518 GKLWLEKGGAYALANIRGGGEFGPKWHDAGLKTNRQRVYDDFQAVAQDLIAKKVTSTPHL 577 

Query: 52 5 GLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVC 584 

G+ GGSNGGL+ ++ P A+V +VPL DM-*- + +SAG+SW EYG+P V 

Sbjct: 578 GIMGGSNGGLLMGVQMIQRPDLWNAVVIQVPLLDMVNFTRMSAGASWQAEYGSPDD-PVE 636 

Query: 585 KRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGG 644 

L +SPYHN+ G+ YP TS DDRV PHAKA + +Y G 

Sbjct: 637 GAFLRSI SPYHNVKAGVAYPEPFFETSTKDDRVGPVHARKMAALFEDMGLPFYYYENIEG 696 



Query: 64 5 GHTGNGTQRESADELACVLLFL 666 

GH +E A A +++ 
Sbjct: 697 GHAAAANLQEHARRYALEYIYM 718 

The following partial DNA sequence was 

a04 1-1 . seq 

1 ATGAAATCCT ACCCCGACCC 

51 CGAAACGCAA AACTTCGCTG 

101 TTTTAAACAA CGACAAGGCA 

151 TTGCAGGACA CGCGGCAAAT 

201 GTACCATTTC CATCAAGATG 

251 GTACCGCGGC GACTTACCGT 

301 TCGGTGGCGG ATTTCGACGA 

351 CGTGTCGCAC CTGGTGGAAC 

4 01 AATCGGGCGG CGATACCGCG 

4 51 GAGTTGGTAG AAGGCGGTTT 

501 GTGGCGCGAT GAAAACAGCG 

551 AGTTGACCGA ATCGGGCTAT 

601 AAGAGTTTCG AGGAAAGCCT 

651 GATGGTGAAC GCGTGGCGTT 

7 01 TGATTGAAGC GTCTGACGGT 

7 51 GCCGAAGGCG AAGCGAAACC 

8 01 CGGCTATCTG GCCGGACATC 
8 51 GCGCGAACCA AAGCTATCCG 
901 CGCGGCGAAT TGGGCGCGGC 
951 GGCATTGGAA AGCGTGGAAA 

1001 TGGAAAACGT ACAGGGTCGT 

1051 TGGCAGGAAA CCGAGTTGCC 

1101 CGACCAACCG TGGGGGGGCG 

1151 CCACGCCGCT GACGCTGTTT 

1201 GTCATGCGCC GCCAGCCGCA 

12 51 GCAGTTTTGG ACGACTTCGG 

1301 TCGGCAAAAA CGCCGCGCCC 

1351 GGTTTCGGCA TTCCCGAATT 

14 01 TTGGCTGGAA GAGGGCAATG 



identified in N. meningitidis <SEQ ID 163>: 



CTACCGCCAT 
CTGAAGCGAA 
CGCGCATTGT 
TCCGTTTTGT 
CGGAATATCC 
TCGGGCTATC 
ATTGCTCGGT 
AGCCCAACCG 
TACACGCTGG 
TCACTTTCCG 
TGTGGGTGTG 
CCGCGCGAGG 
GCCGGTGTAC 
ACCTCGATCC 
TTTTACACCA 
GTTAAACCTG 
TTTTGCTGAC 
AGTGGCGCAT 
GCAGCTTTTG 
CGACCAAGCG 
CTGAAAGCGT 
GCGCCTGCCT 
ACGTAGTTTA 
GCATTGGATT 
GCAGTTTGAT 
CTGACGGCGA 
GACATGCCGA 
GCCGCATTAT 
CCTTTGTATT 



TTTGAAAACC 
TGCCGAAACG 
CTGACGGCAT 
CAGGAACACC 
GAAAGGCGTG 
CTGAGTGGAA 
GACGATGTAT 
CGCGTTGTTA 
AAGTGGATTT 
GCAGGCAAAA 
TCCGGCTTGG 
TGTGGCTGGT 
CAAATTGCTG 
GCAGGGTTCG 
AAACCTATTT 
CCCAACGATT 
CTTGCGTAAA 
TGGTAGCAGT 
TTTGCGCCCA 
TTTTGTCGTG 
GGCGTTTTAC 
TCGGGCGCGT 
CCTTGCCGCC 
TGAACGTGAT 
TCAGACGGCA 
GCGCATTCCT 
CGCTGGTCTA 
CTGGGCAGCA 
GGCGAACATC 



TCGATTCCGC 
CGCGCGCGTT 
TTTGGCGCAG 
GCGCGCGGAT 
TACCGCGTGT 
AATCCTGTTT 
ATCTAGGCGG 
ACACTGAGCA 
GGAAGCAGGG 
ACCATGTGTC 
GACGAACGCC 
GGAACGCGGC 
AAGACGGCAT 
CCGATTGATT 
GCAGGTCTCA 
GCGACGTAGT 
GACTGGCACC 
AAAATTAAAC 
ATGAAACGCA 
GCGAGCCTGC 
TGATGGCAAA 
TGGAAATGAC 
AGCGATTTCA 
GGAACTGACC 
TTAACGTGCA 
TATTTCCACG 
TGCCTACGGC 
TTGGCAAATA 
CGCGGCGGCG 
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14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG 

1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC 

17 01 CGGTTCAAGC TGGACAGACG AATACGGCAA 

17 51 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT 

1801 ATCGATTATC CGCCCGCGCT CATTACCACC 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG 

1951 ACGCAGCGCG AAGCCGCCGA CGAACTCGCC 

2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 164; ORF 041-l.a>: 

a041-l .pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 AEGEAKPLNL PNDCDWGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFVV ASLLENVQGR LKAWRFTDGK 

351 WQETELPRLP SGALEMTDQP WGGDWYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

4 51 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G* 

a041-l/m041-l -97.9% identity in 671 aa overlap 

10 20 30 40 50 60 

a041-l .pep MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLNNDKARALSDGILAQLQDTRQIPFC 
I I I I I I 1 I M I I I I I !! I I I M I I I I I I I I M I I I : I ! I M I I I I I I I I I i M i i I M I I 
m0 4l-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAQLQDTRQIPFC 

10 20 30 40 50 60 

70 80 90 100 110 120 

a041-l .pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I I I I I I II I I I II I I I I I I I i I I I I I I I I II ! II i I I M I I I I I I I I I M I I ! I I I I ( I I 
m041-l QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a04 1-1 . pep LVEQPNRALLTLSKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I | | | M I I I I I I I I I : I I II I I I I II M I M I I I I I I I I I t i I I I I I I I I M M I I I I I 
m041-l LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

a041-l .pep DERQLTESGYPREVWLVERGKSFEESLPVYQIAEDGMMVNAWRYLDPQGS PIDLIEASDG 
: I | | I I : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I II I II I I I I I I I M 1 I I I I I M I I 
m041-l NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGS PI DLIEAS DG 

190 200 210 220 230 240 

250 260 270 280 290 300 

a041-l .pep FYTKTYLQVSAEGEAKPLNLPN DCDWGYLAGHLLLTLRKDWHRANQSYP SGALVAVKLN 
I I I I i M : I I I I I I I I 1 II I I I I I I I II I I I I I I I I I I II I I : I I I I I I I I M I I I I I I I 
m041-l FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 

250 260 270 280 290 300 

310 320 330 340 350 360 

a04 1-1 . pep RGELGAAQLL FAPNETQALE SVETTKRFWASLLENVQGRLKAWRFTDGKWQETELPRLP 
I I I I I I I I I I I I I : I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I : I I 1 I I I : I II I I I 
m041-l RGELGAAQLLFAPDETQALESVETTKRFWASLLENVQGRLKAWRFADGKWQEVELPRLP 

310 320 330 340 350 360 



CGCAGGGAAT 
AGCGATTTGT 
GGGCGGCAGC 
CGCAAAGCAT 
CGTTATCCGC 
TCCGCAAAAA 
ATCACAATCT 
AGCCTGTCCG 
CAAACTGCGC 
GCGGCCATAC 
TGCGTGTTGC 



CAGCAAACAT 
CCGAACGCGG 
AACGGCGGAC 
AGGCGCGCTG 
TGCTCTCCGC 
TACGAAGTCT 
TTCAGACGGC 
ACGATCGCGT 
GAAACCTCGC 
CGGCAACGGC 
TGTTTTTGAA 
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370 380 390 400 410 420 

SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
| | | M | | | | I I I I M I I I I I I I I M I i I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 
SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 

370 380 3 90 4 00 410 420 

430 440 450 460 470 480 

TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGSIGKYWLEEGNAFVLANI 
! M I I I I I I I II t I I M I I I I I I ! II I I I I I I I M I I I M I I I M I I M I I I i I I II I I I 
TTSADGERI PYFHVGKNAAPDMPTLVYAYGGFGI PELPHYLGSIGKYWLEEGNAFVLANI 

430 440 450 460 470 480 

490 500 510 520 530 540 

RGGGEFGPRWHQAAQGISKHKSVDDLLAVVSDLSERGISSPEHIGLQGGSNGGLITAAAF 

M I I I I I I I I I I M I I I 1 I I I I M I I I M I I I I I I I I I I I I I M M I I 1 1 I I I II II II 
RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
490 500 510 520 530 540 

550 560 570 580 590 600 

VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
M I I I I I II I I I I I I I I II I I M I II I I I 1 I I M I I I I 1 I I I I I I I I I I M 1 I I I I I I I I 
VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
550 560 570 580 590 600 

610 620 630 64 0 650 660 

IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQREAADELA 
I | | | | | | | | | | I I I II I I I I I I I It I I I I I I 1 I I M I I I I I I I I I I I I I I I I I : I I I I I 
IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGHTGNGTQRESADELA 

610 620 630 640 650 660 

670 

CVLLFLKEFLGX 
I I I I I I I I i I I I 
CVLLFLKEFLGX 
670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 165>: 

g042 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAagcG TTCGTGCCGC ATACCAGCGC 

51 GTTATCCAAC ACTTCCACGG CAGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TGCGGTCGAT GATGAAAATC CAGCCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACGG GCTGCCCGTG CCCTTCGTTG CGTAAAGATT CGTCCACGGG 

201 CGGCAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GATTGCGTGC 

2 51 CGAAGGCGGA CACCTTGTTG CCTGTAACCG ACAGCACCAG CCCGCGTCCT 
301 TTGCCTTTGG cggCTTCGCG CTTTTGGGCG AACAGCGCGT CAATCTGCGC 

3 51 ATTCAATTCC GCCACGCGCG CTTCCTTACC GAAAATCCGC GACAGGGTCT 

4 01 CCATCTGCTT CTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAAA 
4 51 TCTATGgtgG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCACCCGG 
501 CCCGCCGGTA ATGACAAACT GCGGATTGTG GCGGTGCAGG GATTCGCAAT 
551 CGGGCTCAAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 
601 AAATAG 

This corresponds to the amino acid sequence <SEQ ID 166; ORF 042.ng>: 

g042 .pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMWAFFANC SYASAPGPPV MTNCGLWRCR DSQSGSNSVP TVAALSNAGC 

201 K* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 167>: 



a04 1-1 . pep 
m041-l 

a04 1-1 . pep 
m04 1-1 

a041-l .pep 
m041-l 

a04 1-1 .pep 
m041-l 

a04 1-1 . pep 
m041-l 

a04 1-1 . pep 
m041-l 
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m042 . seq 




1 


ATGACGATGA 


51 


GTTATCCAmT 


101 


TACGGTCGAT 


151 


AAGGAAACAG 


201 


CGGTAGGCCG 


251 


CGAAGGCGGA 


301 


TTGCCTTTGG 


351 


CTTCAATTCC 


401 


CCATCTGCTT 


451 


TCTATGGTGG 


501 


CCCGCCGGTA 


551 


CGGGCTCGAA 


601 


AAATAA 



TTTGCTTGCG CTTCCAAGCG 
ACTTCGACAG CCGcCGGCCy 
GATGAAAATC CAATCGGGGT 
GCTGCCCGTG CCCCTCGTTG 
ATGTCGCCGT GTATCCAACT 
CACCTTGTTG CCCGTAACCG 
CGGCTTCGCG CGTTTGGGCG 
GCCGCGCGCG CTTCCTTGCC 
TTCGCCGCTG GTGCGGATAT 
TCGCGTTTTT CGCTAACTGT 
ATGACAAGCT GAGGATTGTA 
CAGCGTCCCC ACCGTTGCCG 



TTCGTGCCGC GTACCAGCGC 
TTCyTGCCCG ATGGCGGCGG 
TTTTCTCTTT GATGTATTCG 
CGTAAAGATT CGTCTACAGG 
TGCCAACCGC GACTGCGTGC 
ACAGCACCAG CCCGCGTCCT 
AACAGCGCGT CAATCTGCGC 
GAAAATCCGC GCCAAGGTCT 
TGCCGTTGTC CACCGTCAGA 
TCATACGCTT CCGCGCCCGG 
GCGGTGCAGG GCTTCGTAAT 
CCTTGTCAAA TGCAGGCTGC 



This corresponds to the amino acid sequence <SEQ ED 168; ORF 042>: 

m042 .pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 169>: 

a042 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

2 51 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 

551 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAA 



This corresponds to the amino acid sequence <SEQ ID 170; ORF 042. a>: 

a042 .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTS*GL*RCR AS*SGSNSVP TVAALSNAGC 

201 K* 



m042/a042 99.0% identity over a 201 aa overlap 



10 20 30 40 50 60 

m04 2 .pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I I I I I I M I I M I I I MINI I I I I II I I I I I I I I M I M I M I I I If I I M I I 
a04 2 MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 . 20 30 40 50 60 

70 80 90 100 110 120 

m04 2 .pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I II I I I I 1 E I i i I I M I I I M I M I I I M I M I II I II I 11 I M I I I I I I I I I I I 
a04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 



m042 . pep 



130 140 150 1.60 170 180 

AARASLPKIRAKVS ICFS PLVRI LPLSTVRSMVVAFFANCSYAS APGPPVMTSXGLXRCR 
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I I M II I I I II I I I I I II I I I I I I I I M t I I I I I I I I I I I I I I I I t I I I I I I I I ! I t I I I 
a04 2 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 

130 140 150 160 170 180 

190 200 
m04 2 .pep ASXSGSNSVPTVAALSNAGCKX 
I I M I I I I I I I M I I I I I I I M 
a 0 4 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N. gonorrhoeae: 

mO42/g042 

10 20 30 40 50 60 

m04 2 . pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I | | M I I I 1 I I I I : I I I I I MINI I I 1 1 I I I I I I I I I I I I I I I I I I I I M I I II I I 
g 04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2 . pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I | | | I i I I I [ I I I I I i 11 I I I I I I I I I I II II I M I I i I I I I I I M I i I II I I I II I II 
g04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 2 .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSXGLXRCR 
| : | | | H 1 M : M II I M I I I I M II I I : I I I M I I I I I I I I I I M I I I I i : ! I III 
g 04 2 ATRASLPKIRDRVS ICFS PLVRI LPLSTVKSMWAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I I I I I I I I I I I I I I I I I I I 
g04 2 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in AT. meningitidis <SEQ ID 171>: 

m042-l . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCTAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042-l>: 

m04 2-1 . pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR A KVSICFSPL VRILPLSTV R 
151 SMWAFFANC SYASAPGPPV MTS + 



m042-l/g042 95.4% identity in 173 aa overlap 
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10 20 30 40 50 60 

m04 2-l -pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
M I M I I I I I I I I : I I I I I M I i II ! I I I I II II I I I I I If II I I I I II I I I ( I I I I I I 
g 04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2-l .pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I M I I I I I I I I I I I I M II I I 11 II I 11 I I I I M M II I I I I I I I M I I I 11 I I I I I I I 
g 04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFT7ANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m04 2-l .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCS YASAPGPPVMTSX 

I : I I I M I I I I I M 1 1 I I I I I I I I I f I : I I I I M I I I I II I I I 1 I I I I I I : 
g 04 2 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMVVAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

g04 2 DSQSGSNSVPTVAALSNAGCKX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 173>: 

a042-l . seq 

1 ATG AC G AT G A TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

251 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ED 174; ORF 042-1 .a>: 

a042-l.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 
51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 
101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 
151 SMWAFFANC SYASAPGPPV MTS* 

m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m042-l.pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKI QSGFFSLMYS KETGCPCPSL 
I I I I I 1 I I I I I M I I I If I I I I I I I I I I II I I I I I I I I I I I I I I I 1 I I t I I M I I I I I I I 
a04 2-l MTMI CLR FQAFVPRTSALSNTSTAAG PS CPMAAVRSMMK I QSGFFSLMYS KETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m042-l .pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I I I I I I I I I I 1 I I I I M 1 I I M I II I M I M I I I II I I I I I I I I I I I I I II I I I I I M 
a04 2-l RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m042-l.pep AARAS LPK I RAKVS I CFSPLVRILPLSTVRSMWAF FAN CS YASAPGPPVMTSX 
I I I II I I I I I I I I I I I I I I I I I I M I I I I 1 II I I I M I I I I I I I I I I I I I 1 I I I 
a04 2-l AARAS LPK I RAKVS I CFS PLVRI LPLSTVRSMWAFFANCS YASAPGPPVMTSX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 175>: 
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g04 3 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GTCGGCCCAT CAGCACTTTT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GCCTGAACGC TTCgttgaAC 

101 CGTCCCGCGT ggcggtagcc gcAAAAGTGC ATcGCGGCTT GGATGGTGCT 

151 GCCCGATTCG ATGAGGGcga gcGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

201 GTCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GCCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGGCG 

301 GGCGAATTCG CTGTTCAAAA TATCGGCGGC TTCGTCTATG CGCCGGCGGC 

351 GGTAGCCGTT GTCGTGGCGG CGGAAGGTGA AGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 176; ORF 043 .ng>: 

g043 . pep 

1 MWSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GEFAVQN I GG FVYAPAAVAV VVAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 177>: 

m043 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>: 

m043 .pep 

1 MWSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV WAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 
Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from 
K gonorrhoeae: 

m043/g043 

m04 3 . pep 
g043 



m04 3 .pep 
g043 



10 20 30 40 50 60 

MVVSNQNI YAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 

I t I t I I I I I I : I M i I : I I M I I I I I ! I I I I I I I I I I M M I I MINI I ! I M I I I 
MVVSNQNI YAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 
10 20 30 40 50 60 

70 80 90 100 110 120 

QPQAAQASGDGFAGLRFE I AFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

M 1 1 I I I I ! I 1 I I I I 11 I 11 I I 1 M I I I I I I M I I I I I : I I I : : I : : II II I M : It : I 
QPQAAQASGDGFAGLRFE I AFQVAFVQPDAAGDFGDGQRAGEFAVQNIGGFVYAPAAVAV 
70 80 90 100 110 120 



130 

mO 4 3 . pep VVAAEGEAQX 

MINIM 
g04 3 WAAEGEAXX 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 179>: 

a043. seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 
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151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

2 01 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 



This corresponds to the amino acid sequence <SEQ ID 180; ORF 043.a>: 

a043.pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQD VGG FVYAPTAVTV WAA EGEAQ + 

m043/a043 100.0% identity in 129 aa overlap 

10 20 30 40 50 60 

m04 3 . pep MWSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I t I I II I i I I I I I I I 1 I M ! I I II I I M I I I 1 I I I M I 1 I I I I I I M I I I I 1 I I I I I 1 I I 
a 04 3 MWSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 3 pep QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 
| I I I I I II I I I I I M I I II I I I I I I I I I I I I I I I M I I I II I I I II I I I I I I I M I I I I I 
a 04 3 QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGGFVYAPTAVTV 

70 80 90 100 110 120 



130 

m0 4 3 . pep VVAAEGEAQX 
M I I I I I I M 
a0 4 3 VVAAEGEAQX 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 181>: 

g044 . seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGAT G AC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 



This corresponds to the amino acid sequence <SEQ ID 182; ORF 044.ng>: 

g044 .pep 

1 MLPDQSVEFL PQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLHAV VGGLRFAAEK FFFAAAVAH* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 183>: 

m044 . seq 

1 ATGCCGTCCG ACTAGAGCGT CGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

2 01 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

2 51 TGGCTACCGT AG C G C Ay T Aa 



This corresponds to the amino acid sequence <SEQ ID 184; ORF 044>: 

m044 .pep 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGSQFHAV VGGLRFAAEK FFFVATVAH* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 85>: 

a044 . seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044.a>: 

a04 4 . pep 

1 VPSDQRVEFF FQ WVFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGGLRFAAEK FFFVAAVAH* 



m044/a044 91.0% identity over a 89 aa overlap 

10 20 30 40 50 60 

m04 4 .pep MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 
: | | | I | | I I : I It I I I I I I I I I II I I I I I I I M It I : I I I I I I I t f I I I i I I I M I M 
a04 4 VPSDQRVEFFPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 

1-0 20 30 40 50 60 



70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I i I I : I I I : I I I I I ! I I I I I I I I M : I I 11 
a04 4 FDDGGQFHTVVGGLRFAAEKFFFVAAVAHX 

70 80 90 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 044 shows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N. gonorrhoeae: 

m044/g044 

10 20 30 40 50 60 

m04 4 . pep MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAI FDVLRVGADDDGAAAFERFQS 

I I I I ! h h I I I I 1 1 I I 1 1 1 1 I M I I 1 1 1 1 1 I I II : I I I I 1 1 1 I I I I 1 1 1 1 I I I I I 

g044 MLPDQSVEFLPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 

10 20 30 40 50 60 



70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I hhhl : M i Ml' M MlhhMM 
g0 4 4 FDNGGQLHAWGGLRFAAEKFFFAAAVAHX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 187>: 

g046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 

51 G ATG AC CAT C CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC gaATATGGAA AGGCTGCCGt TTTcGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TtcgctGGAA CGGACGCGCG 

2 51 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 
301 ATGTTGGTTT CGTCGCTGCG GGagaGCGCG AGcagcaagt cggcatcttC 

3 51 CgcgccggcG Cgttataatg t g AAGGGGGA TGCGccgttg ccgaAAACGG 

4 01 TTTGGacatC gaggcggctg CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
4 51 TCGATAAcgg TTACGTCGTT GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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501 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 
551 TCGCCGGGTG A 



This corresponds to the amino acid sequence <SEQ ID 188; ORF 046.ng>: 

g046 .pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLVM AARFCATVEP TCPLPKMRIF TVWVAG* 



The following partial DNA sequence 



m046 . seq 






1 


ATGTCGGCAA 


TGCTGCGTCC 


51 


GATGACCATC 


CGCACGCGGT 


101 


CCGGGCAGTC 


TATCAGGCCG 


151 


CTGATGGTTT 


CGGTTATGCC 


201 


TTCGAGCTTG 


GGGCTGAGGT 


251 


CGATGCGGCC 


GGGGATGTTG 


301 


ATGTTGGTTT 


CGTCGCTGCG 


351 


CGCGCCGGCG 


CGTTCTAATG 


401 


TTTGGACATC 


GAGGCGGCTG 


451 


TCGATAACGG 


TTACGTCGTT 


501 


GGTAGAACCT 


ACCTGTCCGT 


551 


TCGCCGAGTG 


A 



identified in N. meningitidis <SEQ ID 189>: 

GACAAGCAsT CCGC . r . sGC gCGcCTGTAT 
CGTCTGCAAA ACGTAAAACC TGCAATGCGC 
GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 
GAATATGGAA AGGCTGCCGT TTTCGTTGTT 
ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 
AACAGGTCGG CGGCAACTTT GCAGGCGACG 
GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 
TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 
CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 
TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 



This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

m04 6 -pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 191>: 

a04 6 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

4 51 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046. a>: 

a046.pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 



m046/a046 98.4% identity over a 186 aa overlap 



10 20 30 40 50 60 

m04 6 . pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
I I I I M I I I I i I I I I I I I I I I I \ I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a04 6 MSAMLRPTSS PPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6. pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 
I I I I I I I i I I I I I I ! I \ I I M I I I I I I I \ I II II I I ! I M I I M I M I I I II I M I M I I 
a04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 . pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 
I I M I I I I I I I II M M I I 1 I M I I! I I I I I I I I I I I I I I I I It I II I I I I 1 I II I M I I 
a04 6 RSNVKG DAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6.pep TVWVAEX 
I I I I I I I 

a04 6 TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 185 aa overlap with a predicted ORF (ORF 046.ng) 
from N. gonorrhoeae: 

m046/g046 

10 20 30 40 50 60 

m046 . pep 
g046 



MSAMLRPTSXPXXRACMMTIRTRSSAK31KTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

MINIMI I 1 M ft 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 11 1 1 I M I I 1 1 1 1 I 

MSAMLRPTSSPPRRACMMTIRTRSSAKTiKTCNAPGOSIRPASCSVTSCSGLMVSVMPNME 
10 20 30 40 50 60 



70 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

I i M I 1 1 1 1 1 1 I 1 1 II I 1 1 1 1 I I M 1 1 i 1 1 1 1 1 1 i 1 1 I I 1 1 1 1 1 M 1 1 MMIIMMM 

g046 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

70 80 90 100 110 120 



130 140 150 160 170 180 

m046 . pep RSbA/KGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

I 1 1 1 1 1 II I It 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t E i 1 1 1 1 1 1 1 1 1 1 1 1 E i 1 1 

g04 6 RYNVKGDAPLPKTWTSRRLPVSCNAFSSMSITVTSLLVMAARFCATVEPTCPLPKMRIF 

130 140 150 160 170 180 



m04 6.pep TVWVAEX 
Mill 

g046 TVWVAGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 193>: 

g047 . seq 

1 ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAAT CTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT GTTTGCCGCC GCCGCCGAAA ACATCGGGGC 

201 GGT CAT AC C C gaATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATc tgctACCGCC TCGCCAAGCA GCT CGAAC AC 
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301 GCATAcaacG T C AAAAT CAT CGAATGCCGG CCGCGCcgtg c eg a ATGGAT 

351 AGCCGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 

401 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 

451 CTGACC AACG ACGACGAAAG CAACATTAtg tCCGCCCTTT TGGCGAAAAA 

501 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 AC CATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCGCACG 

701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGG CATC AAA 

751 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GG AC AC CAT A CCGAAACCGT CATCCAAGAC GGTGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 194; ORF 047. ng>: 

g047 .pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESN I M SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHI IFFVSR RRILNELEKL 

301 IQVKMGFFG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 195>: 

m047 . seq 

1 ATGGTCATCA TACAGgCGcG C . . syGCGGA sTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CCAAAGAAAC CAGCcCmgmm 

251 GcATCATGAT TkCCGGCGGC GGCAACATCG GCTACCGTCT CGCCAAGCAG 

3 01 CTCGAACACG CATACAACGT y AAAAT CATC GAATGCCGGC CGCGCCGTGC 

3 51 CGAATGGATA GCCGAAAACC TCGACAACAC CCTCGTCyTG CAAGGTTCGG 

4 01 CAACCGACGA AACCCTGCTC GACAACGAAT ACATCGACGA AATCGACGTA 

4 51 TTCTGCGCCC TGACCAACGA CGACGAAAGC AACATTATGT CCGCCCTTTT 

5 01 GGCGAaAAAC CTCGGCGCGA AGCGCGTCAT CGGCATCGTC AACCGCTCAA 

5 51 GCTACGTCGA TTTGCTCGAA GGCAACAAAA TCGACATCGT CGTCTCCCCC 

6 01 CACCTCATCA CCATCGGCTC GAT ACT CG C C CACATCCGGC GCGGCGACAT 

6 51 CGTTGCCGTC CACCCCATCC GGCGCGGCAC GGCGGAAGCC AT CGAAGTCG 

7 01 TCGCACACGG CGACAAAAAA ACTTCCGCCA T C AT CGG CAG GCGCATCAGC 

7 51 GG CAT C AAAT GGCCCGAAGG CTGCCACATT GCCGCCGTCG TCCGCGCCGG 

8 01 AACCGGCGAA AC CATT ATGG GACACCATAC CGAAACCGTC AT CCAAGACG 
851 GCGAC C AC AT CATCTTTTTC GTCTCGCGCC GGCGCATCCT GAACGAACTG 
901 G AAAAACT C A TCCAGGTCAA AATGGG CTTT TTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 196; ORF 04 7>: 

m047 .pep 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKI I ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALLAKN LGAKRV I G I V NRSSYVDLLE GNKIDIVVSP 

201 HLITIGSILA HI RRGDIVAV HPIRRGTAEA IEWAHGDKK TSAIIGRRIS 

251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHI IFF VSRRRILNEL 

3 01 EKLIQVKMGF FG* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 197>: 

a047 . seq 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

201 GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 

301 GCATACAACG TCAAAATCAT CGAATGCCGG CCGCGCCGTG CCGAATGGAT 

351 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

4 01 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 

501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 

551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 

601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGCGACA TCGTTGCCGT 

651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 

7 01 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

7 51 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

801 AACCATTATG GGACACCATA CCGAAACCGT CAT C C AAG AC GGCGACCACA 

851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAAAAACTC 

901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 198; ORF 047. a>: 

a047.pep 

1 MVIIQARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHHTETVIQD GDHIIFFVSR RRILNELEKL 

301 IQVKMGFFG* 



m047/a047 96.5% identity over a 312 aa overlap 



10 20 30 40 50 60 

m047 .pep MVIIQARXXGXLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 

t I I II I I I I II II I I I I ! M I t M I I M I I I I I I I i I 1 I I I I I I I I I I I I I I 1 I I I I 
a04 7 MVIIQARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 7 .pep AAENIGAVIPELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 
I I II I I I I M I I M II I : : II I I M II I I I I I I I M I I I I I II I I I I I I 11 I I I 
a04 7 AAENIGAVIPELRPKETSTRR IMI AGGGN IGYRLAKQLEHAYNVKI IECRPRRAEWI 

70 80 90 100 110 



130 140 150 160 170 180 

m04 7 .pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

I II I I I II I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I II I I I I I II I I I I I I I I II 
a04 7 AENLDNTLVLQGSAT DETLLDNEYI DEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

120 130 140 150 160 170 



190 200 210 220 230 240 

m04 7 .pep NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEWAHGDKK 

I I I I I I M I I I I I I I I I I I I II 1 I I I I I I I I I I II I I I I M I I I I I I I I M I I I I I M I I 
a04 7 NRSSYVDLLEGNKIDIWSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEVVAHGDKK 

180 190 200 210 220 230 



250 260 270 280 290 300 

m047 .pep TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

I I I II I I I I M I M I I M I I I I I I 11 ! I I I I I I I 1 I I I I I I It I I I I I I M I I II I M I I 
a04 7 TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHII FFVSRRRILNEL 

240 250 260 270 280 290 



m04 7 . pep 



310 

EKLIQVKMGFFGX 
M I I I I M I I I I I 
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a04 7 EKLIQVKMGFFGX 
300 310 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 047 shows 96.2% identity over a 312 aa overlap with a predicted ORF (ORF 047.ng) 
from N. gonorrhoeae: 



m047/g045 






m04 7 . 


pep 


MV 1 1 QARXXGXLVGRS I AD I AQDLPDGADCQ I CAVYRNNRL I VP APQTV 1 1 EGDE I LFAA 

M1MII 1 IIMIIIIIIIIIMMMIIIIIMIIIIIIMIIIIMIMIiilil 


60 


g047 




MVIIQAI^GGLLVGRSIADIAQDLPDGADCQICAWRNNRLIVPAPQTVIIEGDEILFAA 


60 


m047 . 


pep 


AAENI GAV I PELRPKETQRNQPXXIMIXGGGNI GYRLAKQLEHAYNVKI I ECRPRRAEWI 


120 


g047 




I M 1 1 1 1 1 1 M 1 1 1 1 1 i = : III Mill IIMIIIIMMMIIIMIIIIMI 

AAENIGAVIPELRPKETSTRR IMI AGGGNI CYRLAKQLEHAYNVKI I ECRPRRAEW I 


117 


tn047 . 


.pep 


AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAJQJLGAKRVIGIV 

MIIIIIIMIIMIMIMIIMMIIIIIMIIIIMIMIIIIIIIMMIMIMI 

AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALtLAKNLGAKRVIGIV 


180 


g047 




177 


m047, 


■pep 


NRSSYVDLLEGNKIDIVVSPHLITIGSILAHIRRGDIVAVHPIRRGTAEAIEVVAHGDKK 

IMIIMIMIMIIIIIIIMIIIMIIIMIIIMIMIMIIIIIMIMIIIMM 

NRSS YVDLLEGNKI D I WS PHL I TI GS I LAHI RRGDI VAVHP I RRGTAEAI EWAHGDKK 


240 


g047 




237 


m047 


.pep 


TSAIIGRRISGIKWPEGCHIAAWRAGTGETIMGHHTETVIQDGDHIIFFVSRRRILNEL 

IIIIIIIIMIIIIIIMMMIIIIIMMIIMIIIIIMIMMIIMIIIIIIIII 

TSAIIGRRISGI KW P EGCH I AAWRAG TGE T I MGHHT E TV I QDGDH 1 1 F FV S RRR I LN E L 


300 


g047 




297 


m047 


-pep 


EKLIQVKMGFFGX 313 




g047 




lllllllllllll 
EKLIQVKMGFFGX 310 





The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 199>: 

g048 . seq 

1 ATGCTCGACA 

51 TTACTACGTc 

101 CAGGTCCGAC 

151 AAACAAACCG 

2 01 cacctGCGAA 

2 51 GCGGCGCGGC 

3 01 GCGTTCCCCG 
351 TATGCCCGTA 

4 01 CCGCCCCGCG 
4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 200; ORF 048. ng>: 

g048 .pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 KQTGLLGMIG KSERGAATCE AIADNK AVYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAV YEFEVKDMPV TVAVDSKGES I HATAPRKWQ AKIGIIPVES 

151 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 201>: 

m048 . seq 

1 ATGCTCAACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA AC CG CCTG AT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 



AAGGCGAGGA GTTGCCCGTC 
ggcCCcgTCG ATCCGGTCGG 
CACAGCCACC CGCATGGACA 
GCCTCTTGGG CATGATCGGC 
GCcatCGCCG ACAACAAGGC 
ATACCTCGTG GCAAAAGCCA 
AATTGGGTAT GGAAGCCGTT 
ACCGTCGCCG TGGACAGCAA 
CAAATGGCAG GCGAAAAT CG 



GATTTCACCA ACCGCCTGAT 
CGATGAAGTC GTCGGTCCCG 
AATTTACCCG CCAAATGCTC 
AAATCCGagc gcgGcgcggc 
CGTGTACCTC ATGGCAGTCG 
TCAAATCTTC CAAAGTCTTG 
T AC GAATTTG AAGTCAAAGA 
AGG CGAATCC ATCCACGCCA 
GCATCATCCC CGTCGAGTCT 
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101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGTGGC 

2 01 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG . 

2 51 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

3 51 CATGCCCGTA ACCGTCGCCG TAG AT AG CAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG C AAATGG CAG GCGAAAATCG GCATCATCCC CGTCGAATCT 
4 51 TGA 



This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m04 8 -pep 

1 MLNKGEELPV DFTNRL I YYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGM I G KSERGVATCE A I ADNKA VY L MAVGGAAYLV AKAIKSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 

a048 . seq 

1 ATGCTCGACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGACGAAATC GTCGGCCCAG 

101 CAGGTCCGAC CACCGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGCGGC 

2 01 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

2 51 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

301 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 

351 CATGCCCGTA ACCGTCGCCG TAGACAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCC CCAATGGCAG GCGAAAATCG GCATCATCCC CGTCAAATCT 

4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 204; ORF 048. a>: 

a048 .pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 

51 EQTDLLGMIG KSERGAATCE AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 

151 * 



m048/a048 96.0% identity over a 1 50 aa overlap 

10 20 30 40 50 60 

m04 8 .pep MLNKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQTDLLGMIG 
I I : 1 I I M I I I I I I I I II i I I i M I I I I t : M 1 I I I I 1 I i I I I I I I I i I I I I M I M I I i 
a04 8 MLDKGEELPVDFTNRLI YYVGPVDPVGDEIVGPAG PTTATRMDKFTRQMLEQTDLLGMIG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPV 
I | 1 | I : I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
a04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEA I YEFEVKDMPV 

70 80 90 100 110 120 



130 140 150 

m048 .pep TVAVDSKGES IHATAPRKWQAKIGI I PVESX 

I II I II E I I t I I M I t : I M I I I I I U : I ! 
a0 4 8 TVAVDSKGES IHATAPPQWQAKIGI I PVKSX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g048 

10 20 . 30 40 50 60 

m04 8 pep MLNKGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

1 1 ^ i 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M I 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I ! 1 1 1 = 1 1 MINI 

g04 8 MLDKGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLKQTGLLGMIG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 8 .pep KS E RGV ATC E A I ADNKAVYLMAVGG AA YL V AKA I KS S KVLAF P E LGM E A I Y E F E V KDM P V 

1 1 ] 1 1 = J i 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 J = 1 ! 1 1 I 1 1 1 1 1 

g04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAJCAIKSS KVLAF PELGMEAVYEFEVKDMPV 

70 80 90 100 110 120 



130 140 150 

m048 -pep TVAVDSKGESIHATAPRKWQAKIGIIPVESX 

IIIIIIIIIIIMIIIIIIMIIIIIMIII 

g 0 4 8 TVAVDS KGES I HATAPRKWQAKI G 1 1 PVE SX 

130 140 150 



The following partial DNA sequence was 


g049 . seq 






1 


ATGCGGGCGC 


AGGCGTTTGA 


51 


GGCAGAACAC 


TTCGCGCCGG 


101 


TGGACGGGCA 


TCAACGCCTC 


151 


CCCGTCTGCC 


GCCGTACCGG 


201 


• CAATCTGTGC 


GGTTTCAAAT 


251 


ATT CTCCGCC 


GCGATTCGAT 


301 


AGCCTgcgcg 


TTGAGCCAGT 


351 


CGACTTCCTC 


GCCGCAATCG 


401 


TTCTTCAGAT 


ACGCCTTTAA 



identified in N. gonorrhoeae <SEQ ID 205>: 

TCAACCGTTC GGTCAGCTCC TGTTCGGACA 
TTGACGGCTT TCGGGTTCAG GATATTGATT 
TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 
ATTCTGCCGC ATCGGCGTTT TCCCCGCCCT 
TCGGCACTGT CTTTTTTGGC ATCGAAC CGG 
GTGTTTTTCC GAAAccggca t TTGCAGGGA 
TTTCCTGAAG GACGATCATC GGGTCGGTTT 
GCAACGGCgc tGTTGTGTTC TTCCTGCCAT 



This corresponds to the amino acid sequence <SEQ ID 206; ORF 049.ng>: 

g04 9 - pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG IEPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFDFL AAIGNGAWF FLPFLQIRL* 



The following partial DNA sequence 



tn049 . seq 


(partial) 




1 


ATGCGGGCGC 


AGGCGTTTGA 


51 


GGCAGAACAC 


TTCGCGCCGG 


101 


TGGACGGGCA 


TCAACGTTTC 


151 


CGCCGGCTCA 


TTCGTGCCGG 


201 


CAATCTGTCC 


GGTTTCAAAT 


251 


ATTCTCCGCC 


GCGATTCGAT 


301 


AGCCTGCGCG 


TTGAGCCAGT 


351 


CGACTTCCTC 


GCCGCAATCG 


401 


TTTT T C AG AT 


ACGCCTT . . . 



identified in N. meningitidis <SEQ ID 207>: 

TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 
TTGACGGCTT TCGGGTTCAG GATATTGATT 
TTCCGCATCG TTTTCCCCGT TTTCCGAAAC 
ATTCTGCCTC GTCGGCGTTT TCCCCGCTTT 
TCGACACTGT CTTTTTTGGT ATCAAACCGG 
GTGTTTTTCC GAAACCGACA TTTGCAGGGA 
TTTCCTGAAG GACGATCATC GGGTCGGTTT 
GCAACGGCGG CATTGTGTTC CTCCTGCCAT 



This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>: 

m04 9.pep (partial) 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRF FRIVFPVFRN 

51 RRLIRAGFCL VGVFPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFDFL AAIGNGGIVF LLPFFQIRL. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 209>: 

a049 . seq 
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1 ATGCGGGCGC 

51 GGCAGAACAC 

101 TGGACGGGCA 

151 CCCGTCTGCC 

201 CAATCTGTCC 

2 51 ATTCTCCGCC 

301 AGCCTGCGCG 

351 CGACTTCCTC 

4 01 TTTTTCAGAT 



AGGCGTTTGA 
TTCGCGCCGG 
TCAACGCTTC 
GCCGTACCCG 
GGTTTCAAAT 
GCGATTCGAT 
TTGAGCCAGT 
GCCGCAATCG 
ACGCCTT 



TCAGCCGTTC 
TTGACGGCTT 
TTCCGCACCG 
ATTCTGCCGC 
TCGGCACTGT 
GTGTTTTTCC 
TTTCCTGAAG 
GCAACGGCGG 



GGTCAGCTCC 
TCGGGTTCAG 
CCTTCGCCGT 
ATCGGCGTTT 
CTTTTTTGGC 
GAAACCGACA 
GACGATCATC 
CATTGTGTTC 



TGTTCGGACA 
AATATTGATT 
TTTCCGCAAC 
TCCCCGCCTT 
ATCAAACCGG 
TTTGCAGGGA 
GGGTCGGTTT 
CTCCTGCCAT 



This corresponds to the amino acid sequence <SEQ ED 210; ORF 049.a>: 

a04 9 . pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNLS GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGGIVF LLPFF QIRL 

m04 9/a04 9 90.6% identity over a 139 aa overlap 

10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDG FRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

[ I | I I M I I I M I I I I I M t t I I I I I I I I I : I M I I 11 M I I : I MM I : M 

a0 4 9 MRAQAFDQPFGQLLFGQAEHFAPVDG FRVQN I DLDGHQRFFRTAFAVFRN PVCRRTRFCR 

10 20 30 * 40 50 60 



70 80 90 100 110 120 

m04 9 pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 
: M M M M M M I M I I M M M M M M M M I M I I M M M I M M M M t M M 
a 04 9 IGVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFFRNRHLQGSLRVE PVFLKDDHRVGFDFL 

70 80 90 100 110 120 



130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 
M M M M M M I M M M 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 049 shows 86.3% identity over a 139 aa overlap with a predicted ORF (ORF 049.ng) 
from N. gonorrhoeae: 

m049/g049 

10 20 30 40 50 60 

MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

II MMII IM M M il I! Mill II M1MII ! M I M Ml M MM IMM 

MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRLFRTAFAVFRNPVCRRTGFCR 
10 20 30 40 50 60 

70 80 90 100 110 120 

VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGSLRVEPVFLKDDHRVGFDFL 

Hlllhll Mil lilllhiMhilMIMIIIIIIMII IIIIIIMIIMIIM 

I GV F PALNLCG F KFGTVF FGI E PDS P PRFDV F FRKRHLQGS LRVE PV FL KDDHRVGFD FL 
70 80 90 100 110 120 



m04 9 .pep 
g049 

m04 9 . pep 
g049 



130 139 
m04 9 . pep AAIGNGGIVFLLPFFQIRL 

II MM -M M IMMIM 

g04 9 AAI GNGAWFFLPFLQI RLX 

130 140 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 21 1>: 

g050 . seq 

1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgtgt TGATGGcaaA AG AAT C C CTG ATGAGCCACA 

101 TCGAcatCca a GaATTG C AG GAAAAAGCCG CGTccggggc ggaattgtcc 

151 accaccgaAG ccCTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

301 AACTGTGCcg ccacCCGcca cgtcgAATTT GAATTGgACG GCTCAGGtcc 

3 51 TGTCGAactc acgccGCcgc gtgtCGAAGA CTG A 

This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>: 

g050 -pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAAS GAELS 

51 TTEALRLELF E KVN ALG I G A QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 213>: 

m050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C. .agCCgAA AAAGCCGTGC TGATGGCAAA AGAGT C C CTG ATGAGCCACA 

101 TCGACATTCA AGAAT TG C AG GAAAAGGCCG CGTCCGGCGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

2 01 CATCGGCGCA CAAGGCTTGG GCGGACTGAC CACCGTGTTG GACGTGAAAA 
251 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

3 01 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 
3 51 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 

This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>: 

mOSO -pep 

1 MGAGWCPPGI LGIGIGGXAE KAVLMAKESL MSHIDIQELQ EKAAS GAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 215>: 

a050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAAT TGC AG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 

This corresponds to the amino acid sequence <SEQ ID 216; ORF 050. a>: 

a050 . pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 

moso/aoso 97.7% identity over a 129 aa overlap 

10 20 30 40 50 60 

m050 . pep MGAGWCPPGI LGIGIGGXAE KAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

I ! I I I II I I M I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
a050 MGAGWCPPGI LGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m050.pep EKVNALGI GAQGLGGLTTVLDVKI LDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVEL 

I I ! M M I 1 I I I I I I I 1 M I I II 11 I M I M I I I I M M M I I I I 1 ! M I I I I I I t I M I 
a 050 EKVN ALG I GAQGLGGLTTVLDVKI LDYPTHAASKP I AM I PNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

m050 . pep TPPRVEDGPIX 

I I I I II I I 
a050 TPPRVEDWP 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF 050.ng) 
from N. gonorrhoeae: 

m050/g050 

10 20 30 40 50 60 

m050 pep MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

IMIIIIIIMMMIh I M 1 1 ! 1 1 1 1 1 1 1 M 11 I ! I 11 1 1 E M 1 1 i M i 1 1 1 t M 1 I 

g 050 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 5 0 pep E KVNALG I GAQGLGGLTTVLDVKI LDYPTHAASKP I AMI PNCAATRHVEFELDGSGPVEL 

i 1 1 [ 1 1 1 L 1 1 k 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 L i I 1 1 t E f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 t i 1 1 E 

g 050 EKVN ALG I GAQGLGGLTTVLDVKI LDYPTHAASKP I AM I PNCAATRHVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

m050 .pep TPPRVEDGPIX 

lllllll 

g0 50 TPPRVEDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 217>: 

g050-l . seq 

1 ATGACCGTTA TCAAGCAAGA AGACTTTATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 

8 51 GCCACGTCGA ATTTGAATTG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGACC CGATCTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

12 01 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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12 51 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAGG 
1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CATACCTCGT GGCAAAAGCC 

13 51 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 
1401 TTACGAATTT GAAGTCAAAG ATATGCCCGT AACCGTCGCC GTGGACAGCA 
1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 
1501 GGCATCATCC CCGTCGAGTC TTGA 

This corresponds to the amino acid sequence <SEQ ID 218; ORF 050-l.ng>: 

g050-l .pep 



1 


MTVIKQEDFI 


51 


ILVNSRMCAE 


101 


AYTWEGNTLR 


151 


GGSENKSKLA 


201 


VLMAKESLMS 


251 


LGGLTTVLDV 


301 


PRVED* PDLT 


351 


AHKRLVNMLD 


401 


KFTRQMLKQT 


451 


IKSSKVLAFP 


501 


GIIPVES* 



g050-l/p!4407 

Sp|P14 407|FUMB_ECOLI FUMARATE HYDRATASE CLASS I, ANAEROBIC (FUMARASE) 
>gi|280063|pir | IB44511 fumarate hydratase (EC 4.2.1.2) fumB, iron-dependent - Escherichia coli 
>gi 1 146048 (M27058) anaerobic class I furoarase (EC 4.2.1.2) [Escherichia coli] Length = 548 

Score = 172 bits (432), Expect = 4e-42 

Identities = 138/488 (28%), Positives = 216/488 (43%), Gaps = 22/488 (4%) 

Query : 1 1 QSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAENNRPICQDTG 70 

Q+ DA + H K L+ E + K Q L NS + A+ P CQDTG 
Sbjct: 53 QAFHDASFMLRPAHQKQVAAILHDPEASEND KYVALQFLRNSEIAAKGVLPTCQDTG 109 

Query: 71 IATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGKRQNTKDNTPA 130 

A + K G V W E+ +++GV YEN + A K NT N PA 

Sbjct: 110 TAIIVGKKGQRV-WTGGGD-EETLSKGVYNTYI-EDNLRYSQNAALDMYKEVNTGTNLPA 166 

Query: 131 V I HMS I V PGGK V E VT C AAKGGG S EN K S KL AMLNPS DN I VDWVLKT I PTMGAGWC P 185 

I + V G + + C AKGGGS NK+ L A+L P + +++++ + T+G CP 

Sbjct: 167 QIDLYAVDGDEYKFLCVAKGGGSANKTYLYQETKALLTPG-KLKNFLVEKMRTLGTAACP 225 

Query: 186 PXXXXXXXXXTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEKVNXXX 245 

p T + L + +H EL + + L EL E+ 

Sbjct: 22 6 PYHI AFVIGGTSAETNLKTVKLASAHY-YDELPTEGNEHGQAFRDVQLEQELLEEAQKLG 284 

Query: 24 6 XXXXXXXXXTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSG PVELTPP 301 

D++++ P H AS P+ M +C+A R+++ +++ G +E P 

Sbjct: 285 LGAQFGGKYFAH-DIRVIRLPRHGASCPVGMGVSCSADRNIKAKINREGIWIEKLEHNPG 343 

uery: 302 RVEDXPDLTYSPDNGKRVDVDKLTKE EVASWKTGDVLLLNGKI LTGRDAAHKRLVNM 358 

+ +VD+++ KE +++ + L L G 1+ GRD AH +L + 

Sbjct: 34 4 QYI PQELRQAGEGEAVKVDLNRPMKEILAQLSQYPVSTRLSLTGTI I VGRDI AHAKLKEL 4 03 

Query: 359 LDKGEELPVDFTNRLIYYXXXXXXXXXXXXXXXXXTTATRMDKFTRQMLKQTGLLGMIGK 418 

+D G+ELP + IYY TTA RMD + + G + M+ K 

Sbjct: 40 4 IDAGKELPQYIKDHPIYYAGPAKTPAGYPSGSLGPTTAGRMDSYVDLLQSHGGSMIMLAK 4 63 

Query: 419 SERGAATCEAIADNKAVYLMAVGG-AAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 477 

R +A + YL ++GG AA L ++IK + +A+PELGMEA+++ EV+D P 

Sbjct: 464 GNRSQQVTDACHKHGGFYLGS I GGPAAVLAQQS I KHLEC VAYPELGMEAI WKI EVEDFPA 523 

Query: 47 8 TVAVDSKG 4 85 

+ VD KG 
Sbjct: 524 FILVDDKG 531 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 219>: 

m050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTATC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG AACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTAACGAAGG CGTACGCCGC 
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301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 GCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

12 51 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1>: 

m050-l -pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM NVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VKALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLN KGEELPVDFT NRLIYYVGPV DPVGDEVVGP AGPTTATRMD 

401 KFTRQMLEQT DLLGMIGKSE RGVATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

m050-l pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
| | | | | | | I I i 1 I I I I ! I I I 1 I 1 I I I I I I I I I I I I I I II ! II t I I I I I I I I I I I 1 I I II I I 
g050-l MTVIKQEDFIQS I CDAFQFISYYHPKDY I DAL YKAWQKEENPAAKDAMTQ ILVNSRMCAE 

10 20 30 40 50 60 

•70 80 90 100 110 120 

m050-l . pep NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 
| | | | | | | | | I I I I I I I i I I I : I I I I I I I I I I : I I I I I I I I I M I I 1 I I I I I I I I I I I II i 
g050-l NNRPICQDTGIATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGK 
70 80 90 100 110 120 

130 140 150 160 170 180 

m050-l pep RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
I I I I I I I I I 1 I I I 1 I I I I I I I I I I t I I I I t I I I I I I I M I I I I I I I I I ! I I I I I I M I I I 
q050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m050-l pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I 1 I I I I I I I I I I I I I I I I I It I I I H I I I I 1 I I t I I I I I I I I I i I I I I I I I I i i I I M I I 
q050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m050-l -pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
t I I I I I I I I I I I I I II I I I II M I I I I I I I I I I I M I I I I I I I I I I t I I ! I I I I i I I I I I 
g050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 
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310 320 330 340 350 360 

m050-l . pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 
| I | I I ! I I I ! I i I I I I I I i I I I I I f 1 I I I I I I I I I 1 ! 1 M I I I I I II I I M I I I I : I E : 
g050-l pRVEDXPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVNMLD 

310 320 330 340 350 360 

370 380 390 400 410 420 

m050-l pep KGEELPVDFTNRLIYYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGK5E 

| | | | ( | | 11 | | I I | i I I I I I I I I II I I I i I I I II H I I I I I i t I I I I - I I I I I I I I t I I 
g0 5 0-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

m050-l.pep RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPVTVA. 

i t : I I 1 i i I I I I I I I I I I I I I I I I I I I I I I I I I It t I I I I I I I I I I : I I I I I I I I I I I 1 I 
g050-l RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
m050-l.pep VDSKGESIHATAPRKWQAKIGIIPVESX 
i I II I I I I I I I I I I I I I I I I l-l II I I I I 
g050-l VDSKGESIHATAPRKWQAKIGIIPVESX 

490 500 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 22 1>: 

a050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGGT ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGAAGG CGTGCGCCGC 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGAC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 
801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

8 51 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 
901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 
951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

12 01 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

12 51 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

14 01 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

1451 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-l.a>: 

a050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

2 51 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLD KGEELPVDFT NRLIYYVGPV DPVGDEIVGP AGPTTATRMD 

4 01 KFTRQMLEQT DLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

451 IKSSKVLAFP ELGMEAIYEF EVKDMPVTVA VDSKGESIHA TAPPQWQAKI 

501 GIIPVKS* 
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a05O-l/m050-I 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-l.pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
| | | 1 | I I I I I I i t I i I M I I I I I I I i I I i i M I I I t I t I I 1 I M I I I I I t I I 1 I M I I 1 I 
m050-l MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

a050-l pep NNRPICQDTGIATVFLKVGMDVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 

| | | | | | | | | I I I I I I : I I I I I I I I I I I I I I I I I I M I I I I i t I I I I I I I I I I I I I 

m050-l NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

a050-l pep RQNTKDNTPAVIHMSI VPGDKVEVTCAAKGGGSENK5KLAMLNPSDNIVDWVLKTI PTMG 
| | | | | | | | | | M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I 1 I I 
m050-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

a050-l pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
M | | 1 | | | | | | | I I I II I I I I I 1 I I II I I I I I I I I I I I I I I I I I I M I i I I M I I I I M I 
m050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

a050-l pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
| | | | | | | | ! | | | | | I I I I I M I I I I I I I I I II I I I I I I 1 I I I I I I I M I I I I I I I I I I t I 
m050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
250 260 270 280 290 300 

310 320 330 340 350 360 

a050-l pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 
| M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I : 
m050-l PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 
310 320 330 340 350 360 

370 380 390 400 410 420 

a050-l pep KGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
| | | | | | | | | I I | I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I M I I I I I I I I t I I I I 
mO 5 0 - 1 KGEELPVDFTNRLI YYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQT DLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 4 60 470 480 

a050-l pep RGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 

| | : I I 1 I I I I I I I I I I I I I I I I II I I I I I I I I I I I" I I I I I I I I I I I I I I I I I I I I 

m050-l RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
a050-l.pep VDSKGESIHATAPPQWQAKIGIIPVKSX 
I 1 I I I I I I I I I I I : I I I I I I II I I : I I 
m050-l VDSKGESIHATAPRKWQAKIGIIPVESX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 223>: 

g052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

2 51 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 

g052 .pep 
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1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKAS KS ATS P 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 225>: 

m052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC C AAAAAC AG C AGTTTGGTGT TGGC TTTGAC 

2 01 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052 .pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKAS KS ATS P 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 227>: 

a052 . seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 228; ORF 052. a>: 

a052 .pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 



m052/a052 95.8% identity over a 1 19 aa overlap 

10 20 30 40 50 60 

m052 .pep MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
I I I I I I I I I I II I I I I I I I I I I M I J I I ! 1 I I I 1 I I I I I I I M I I I I I I I I !] I I I! II 
a052 MALVAEETE I SAPCFKGXE PTGDSRLLSTTKSAPMPCANSAKASKSAT S PKGLDGVSKNS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m052 . pep SLVLALTAAFHSFISVGDTRLTPMPNLVTMLLIKPTWPNRLRLETTWSPACRKVKNAAX 
I 1 I I I M I M I I I M I I I I !l I I II I M I II I I I I I I I I I I I I I I I t II : II I I M I 
a052 SLVLALTAAFHSFISVGDTXLTSMPNLVTMLLIKPTWPNRLRLEITWSPACKKVKNAAX 

70 80 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 052 shows 95.8% identity over a 1 19 aa overlap with a predicted ORF (ORF 052.ng) 
from N. gonorrhoeae: 

m052/g052 



10 20 30 40 50 60 

m052 . pep MALVAEETE I SAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

III Mil IIIIMIIIMI IIIIIIMI IMIIIIIIIIIIIIIII Mlillllll I III 



WO 99/057280 



PCT/US99/09346 

259 



g 052 MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 5 2 pep SLVLALTAAFHSFISVGDTWLTSMPNLATMLLIKPTWPNRLRLEITVJS PACKKVKNAAX 

I II I I I i Ml I I I 111 I !i M I M i M I Ml MM M I Ml M lllllhlMIIM 

g 0 5 2 S LVLALTAAFHS F I S VGDTRLTPMPNLVTMLL I KPTWPNRLRLETTWS P ACRKVKNAAX 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 229>: 

g073 . seq 

1 ATGTGTATGC CATACGCAAT AAGGGTT T C A GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCG AC CG A CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

2 01 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 

2 51 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

3 01 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 
3 51 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 230; ORF 073 .ng>: 

g073 .pep 

1 MCMPYAIRVS DGICRIFPPM PSETRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 SPGRIPAASF SSGCILPCW VHGLVMV ERT SPRLAVREKS STTFHAAAWS 

101 ATSKPMTMPP PFCCLRISSA CGWSGNPV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 1>: 

m073 . seq 

1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC AGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

101 TCAAATCGCC AAC AT ATT C C AAACCGACCG ACAGGCGCAC CAATCCGGGG 

151 CGGATGTTGG CGGCGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

201 TGTCCACGGG TGGGTAATGG TCGAGCGCAC GTCACCGAGG TTGGCGGTGC 

251 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCCGC TTCTTGATCG 

3 01 GCAACTTCAA AG C CGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

3 51 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 

This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 

m073 .pep 

1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTNPG 

51 RMLAASF SSG CILPCVWHG WVMV ERTSPR LAVREKSSTP STTFHAASXS 

101 ATSKPMTMPP PFCCLRISAA XGWSDNPV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ LD 233>: 

a073 . seq 

1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

51 TCCGCCGATG CCGTCTGAA . CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAGAAAAGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC CGTTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073. a>: 

a073 .pep 

1 TCMSYKIRVS DGICGVFPPM PSEXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 

101 AWSATSKPMT MPPPFCCLRI SSA*GWSGNP V* 



m073/a073 92.3% identity over a 130 aa overlap 
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10 20 30 40 50 

m073.pep MCMPYKIRVSDGICC PMPSETRNQRASACFKSSIKSPTYSKPTDRRTNPGRMLAASF 

|| M I I I I I I I I II I M : i I I I I I I I I I I I I I II I I I I I I II I II I I I I I I 1 I I 

a 07 3 TCMSYKIRVSDGICGVFPPMPSEXRNQRASACFKSSIKSPTYSKPT DRRTNPGRMLAASF 

10 20 30 40 50 60 



60 70 80 90 100 110 

m07 3.pep SSGCILPCVWHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 
I | | | | | | | | | | | | I I I I I I M II II II 1 I I I M I I I I I I t : I I I I I I M I I I M I I I I I 
a 07 3 SSGCILPCVWHGWVMVERTSPRLAVREKSSTPSTTFHAAAWSATSKPMTMPPPFCCLRI 

70 80 90 100 110 120 



120 129 
m073.pep SAAXGWSDNPVX 

I : I I I I I I I I I 
a073 SSAXGWSGNPVX 

130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073.ng) 
from N. gonorrhoeae: 

m073/g073 

10 20 30 40 50 

MCMPYKI RVSDG ICC PMPSE TRNQRAS ACFKS SIKS PTYS KPTDRRTNPGRMLAAS F 

Mill MINIM I 1 1 1 I 1 1 1 1 1 1 I I 1 I I M I I 1 1 I 1 I I 1 I I I ! I = M I = I I I I 

MCMP YAI RVSDG I CRI FPPMPSETRNQRAS ACFKS SIKS PTYS KPTDRRTS PGR I PAASF 
10 20 30 40 50 60 

60 70 80 90 100 110 

SSGCILPOTVVHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

Illllllllllll MINIUM Mill Ml 11111= I I I I 1 1 I M 1 1 1 1 I I I I I 

S S GC I LPCWVHGLVMVERTS PRLAVREKS S T TFHAAAWSATSKPMTMPPPFCCLRI 

70 80 90 100 110 



m073 ♦ pep 
g073 

m073 . pep 
g073 



120 129 
m0 7 3 .pep SAAXGWSDNPVX 

hi III IMI 

gO 73 SSACGWSGNPVX 
120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 235>: 

g07 5 . seq 

1 ATGCCGCCTT ACTTCATCAC CCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTTTCGG GCAACGCTGC GTTTGCCTGT 

151 GCCGCCAAAG CCAGCGGGGC GGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTT ACGAAATTTT 

251 TAAAAAAATG TGTTTGCGGG CTTTGTGAAG GTTTTAGAGA CCGCCTGCCG 

3 01 GGCCTCTTAA ACTTAATCTT CTTTTTCGTA GAATCCGAAA ATTACAAATT 

3 51 CCCCGCCTAT CTCTTCCAAT GCCGAGCTAA AAGCGTCTTC ATAGCTGTCA 

4 01 TATTTACCGG CTGA 

This corresponds to the amino acid sequence <SEQ ID 236; ORF 075. ng>: 

g07 5 .pep 

1 MPPYFITLLT MENTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMIFSF TKFLKKCVCG LCEGFRDRLP 

101 GLLNLIFFFV ESENYKFPAY LFQCRAKSVF IAVIFTG* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 7>: 

m07 5 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAATA CAAAAAGCGC 

51 GGCGAAAATG CCCACTACAA TCCAACCGGC TT C CAT AC CG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCCGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

2 01 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

2 51 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

3 01 TCAGTCTTTC AAAAATC CGA AT ACG A C AAA TTCGTATTGG TTGCCGATTT 

3 51 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

4 01 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 238; ORF 075>: 

m07 5 .pep 

1 MPSYFITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075. ng) 
from N. gonorrhoeae: 

m075/g075 

10 20 30 40 50 60 

MPSYFITLLTOENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGOTAFACAAKARGAAVT 

II I M I I II Mill M II IIIIIIIMMIIMMIIMIIIhlllllll! I Ml 

MPPYFI TLLTMENTKSAAKTPTTIQPAS I PSAFAASKAFFAVSGNAAFACAAKASGAAVT 
10 20 30 40 50 60 



m0 75 . pep 
9075 



70 80 90 100 110 

m075 .pep TASFAPYLRQVLINFMIFSF KKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVAD 

I I I I I I I I M i I I I I I 11 I I MM : M I I : :| I :== = I 

g075 TASFAPYLRQVLINFMIFSFTKFLKKCVCGLCEGFRDRLPGLLNLIFFFVESENYKFPAY 

70 80 90 100 110 120 



120 130 
m075 .pep FFQTCVNRFFEWEI IGIGDX 

:|| :: | :| = | 
g 0 7 5 LFQCRAKSVFI AVI FTGX 

130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 239>: 

a075 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC • TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

2 51 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

4 01 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075. a>: 

a075 .pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 
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m075/a075 98.5% identity over a 1 36 aa overlap 

10 20 30 40 50. .60 

m075.pep MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
I M I I I I I ! I I I : I I I I I I I I I I I I I M I I M I I I I I I M M I I I I i I I I I I I I i I I 1 I 
a 07 5 MPSYFITLLTMEKTKSAAKT PTT IQPAS I PSAFAASKAFFAVSGNVAFACAAKARGAAVT 

10 20 30 40 50 60 

70 80 90 100 110 120 

m07 5.pep TASFAPYLRQVLINFMI FSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 

M I I 1 I M | I I ! I I I I I I ! I I I 1 i I I I I I I I I I I I I I I I I I I I I I I I I I I M M I 1 I I M 
a07 5 TASFAPYLRQVLINFMI FSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 

70 80 90 100 110 120 

130 

m075.pep CVNRFFEWEIIGIGDX 
I M I I M I I I I I I I I I I 
a07 5 CVNRFFEWEIIGIGDX 

130 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 241>: 

g080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG * CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGAC AT C A ATGGCGCACA GGAAGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAAcgaGA TGAAACGCCT CCgGCTTTTT 

601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 

651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACG 

7 01 GTTTACCCGA AAAAGAATcc gAAGAATatt gggaacaggt ttgggacata 

751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 

801 GGGCAGacga acaatggaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ng>: 

g080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLAASGLVW FYNSNHLPVK QVS LKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWNWL DNG I TVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVPH APDGLPEKES EEYWEQVWDI 

2 51 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 243>: 

m080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

2 01 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGG CGTGG AT TGTCGTTTTG GACAACGGCA 

551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 
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601 ACCGAAGCGT GGC AG CAT C T GTTGCGTAAA AATAAAAATC GGTTATCCTA 
6 51 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441; ORF 080>: 

m080 . pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDE FS TVTiAK QGLGIKEMTY TARSAWIWL DNGI TVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF 080.ng) 
from gonorrhoeae: 

m080/g080 

10 20 30 40 50 60 

MWDNAEAME RLT RWLLVMMAMLLAAS G L VW F YNS NHL P V KQ V S L KGNLV Y S D KKT LG S LA 

1 1 1 e 1 1 1 1 e 1 1 1 1 1 1 1 r i f 1 1 1 1 i r 1 1 1 1 1 e 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 ! I ( 1 1 1 1 1 = 1 1 1 1 1 

MWDNA£AMERLTRWLLV^IMAMLIJAASGLVWFYNSNHLPVKQVSLKGNLVYSDKi<ALGSLA 
10 20 30 40 50 60 

70 80 90 100 110 120 

KEY I HGN I LRTD I NGAQE AYRR YPW I AS VMVRRRF PDTVEWLTE RKPVARWGDHALVDG 

I II i Ml I il II III II I I II I III M II ' I I I! II M II I: I II II II I I ill li IM 

KE Y I HGN I L RTD I NGAQE AYRR YPW I AS VMVRRRFPDTVEVVLTE RKPVARWGDHALVDG 
70 80 90 100 110 120 

130 140 150 160 170 180 

EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i i E 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 III 

EGNVFEARI^RPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWNVVL 
130 140 150 160 170 180 

190 200 210 220 230 240 

DNG I TVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRY ASDGLPEKES 

I I II II II II Mill II I MM II II I I MM I I Ml li Mill II II = 1 IMMMI 

DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVPHAPDGLPEKES 
190 200 210 220 230 240 



EEX 
I I 

EEYWEQVWDI LRPGVGNGSTQI S IS YKGRRTME QQX 
250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 245>: 

a080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CAT GAT GG C G ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 

501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 



m08 0 . pep 
080 ■ 

m080 . pep 
080 

m08 0 . pep 
080 

mO 8 0 . pep 
080 

m080 .pep 
080 
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551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

7 01 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORF 080.a>: 

a080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 VVLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

m08o/a080 99.2% identity over a 242 aa overlap 

10 20 30 40 50 60 

m080 pep MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKTLGSLA 
| | | | [ | || | | | | M I I I M I I I I ! I M M I I II II M II I I M I I I I I I I I I I I : M It I 
a080 MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVY SDKKALGSLA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m080 pep KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
| | M ! | | I I I I II I I I 11 I I I M t II I I I I I M M I I I I I I M I I I II I M M M I I I I I 
a 080 KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m080 pep EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 

M || | || I I I I II I I I I I II I I 1 I I M M I I I I I I II I I I I I M M M I I I I I 

a0 80 EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m080 pep DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 
| | I I I I I I I I I I I I M I I I M I I I I I I M M I I I M I I 1 I II I I f I M I I I I I I I I I I I 
a 08 0 DNGITVRLGREKEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAPDGLPEKES 

190 200 210 220 230 240 



m080.pep EEX 
I I I 

a080 EEX 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 247>: 

g081 . seq 

1 ATGAAACCAC TGGAC CT AAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AACAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGATA 

101 TTCGGGAAGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGACGCG 

151 CATGATTTTG TTGGAGGCGT ATTGTCTGCG GGCGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGCGG CTTTGGGCGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC GTTGCAAACG TTGGCGAAGG CGTGGCGCGA TAATGTGAAC 

301 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

351 GATGCTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATGAT GCCGTTTCGG 

401 CGACGGCAGG CAACTTCAAC AACCACAtcg gaTTGCCGCT GACTTTATTG 

451 AAATtaaAcg a AAAACAC CG CTATGCCGTG ATTGAAATGG GCATGAACCA 

501 TTTTGGcgaa ctggcggtTt taacgcaaaT CGCCAAACCC GATGCCGCTT 

551 TGGtcaACAA CGCCCTGCGC GCCCATGTCG GATGCGGTTt cgacggagtg 

601 GGCGATATTG CCAAAGcgaa aagcGAGATT TatgcagGct tATGTTCAGA 

651 CGGCATGGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGTT TAATTTGAAT ACGTGCACTT TCGGCGTCGA TAGCGGCGAT 

751 GTCCGCGCGG AAAATATCGT GCTGAAACCT TTGTCGTGCG AATTTGATTT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCCGGT 

9 01 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

9 51 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

10 01 ATACTTATAA TGCGAATCCC G AC AG T ATG A AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAGGAc gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

12 01 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

12 51 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

13 01 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GGCATTGGAG 
13 51 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 081. ng>: 

g081 -pep 

1 MKPLDLNFIC QALKLPMPSE NKPVSRIVTD SRDIREGDVF FALAGGRFDA 

51 HDFVGGVLSA GAAAVWSRE DCAALGGALK VDDTLAALQT LAKAWRDNVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVSATAGNFN NHIGLPLTLL 

151 KLNE KHRY AV IEMGMNHFGE LAVLTQIAKP DAALVNNALR AHVGCGFDGV 

201 GDIAKAKSEI YAGLCSDGMA LIPQEDANMA VFKTATFNLN TCTFGVDSGD 

2 51 VRAENIVLKP LSCEFDLVCG DERTAWLPV PGRHNVHNAA AAAALALAAG 
301 LSLNDVAEGL QGFSNIKGRL NV KAG I KG AT LIDDTYNANP DSMKAAVDVL 
351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 
4 01 AAEKFGADGL WFAAKDPLIQ VLSHDLPERA TVLVKGSRFM QMEEWEALE 
451 DK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 249>: 

m081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGAGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGTGAAT 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 
3 51 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGGCGATGAT GCCGTGTTGG 
401 CGACGGCAGG CAACTTCAAC AACCATATCG GATTGCCGCT GACTTTGTTG 
451 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG G CAT GAAC C A 
501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGC CGCAT 
551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 
601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 
651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 
701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGG CG AT 
751 GTTCACGCGG AAAATATTGT GCTGAAAC CG TTGTCGTGCG AATTTG ATT T 
801 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 
851 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 
901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAAT AT CAA 
951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGTCGGCG 

1151 CGTATGCCCG CGACCAAGGC ATCGAAGCGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATT CAAG TGTTGCGCCA CGATTTGCCC GAACGCGCCA 

13 01 CCGTGTTGGT G AAAGGTT CG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

13 51 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 08 1>: 

m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGERFDA 

51 HP FVEDVLAA GAAAVWS RE DCAAMDGALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNE KHRY AV IEMGMNHFGE LAVLTX I AKP NAALVNNAMR AHVGCGFDGV 
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2 01 GDIAKAKSEI YQGLCSDGIA LIPQEDANMA VFKTATLNLN TRTFGIDSGD 

2 51 VHAENIVLKP LSCEFDLVCG DERAAWLPV PGRHNVHNAA AAAALALAAG 
301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAIDVL 

3 51 ARMPAPRIFV MGDMGELGEL GEDEAAAMHA EVGAYARDQG ' I EAAYFVGDN 

4 01 SVEAAEKFGA DGLWFAAKDP LIQVLRHDLP ERATVLVKGS RFMQMEEWE 
4 51 ALEDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 081 shows 94.1% identity over a 455 aa overlap with a predicted ORF (ORF 081.ng) 
from N. gonorrhoeae: 

m081/g081 

m081 .pep 
g081 



10 20 30 40 50 60 

MKPLDI^FICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 = I 1 1 1 1 i 1 1 1 1 1 1 1 I MINIMI MMMM MM 

M K P LDLN F I CQ AL KL PM P S EN KP V SRI VTDS RD I RE GD V F F ALAGGR FD AHD FVGG VL S A 
10 20 30 40 50 60 



70 80 90 100 110 120 

GAAAVWSREDCAAITOGALKVDDTLAAIjQTLAKAWR^ 

MMMIMMMh MMMMMMMMMMMMMMMMMMMMMM 

G AAA VW S RE D C AALGG AL KVDD T LAALQTLAKAW RDNVN P FV FG I TG S GG KTTV KE MLA 
70 80 90 100 110 120 



130 140 150 160 170 180 

AVLRRRFGDDAVLATAGNFNNH I GLPLTLLKLNEKHRYAVI EMGMNHFGELAVLTXI AKP 

MMMIMIM IMMMIMMMIMMMMMMMMMMMIMM MM 

AVLRRRFGDDAVSATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQIAKP 
130 14 0 150 16 0 170 180 



190 200 210 220 230 240 

NAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANiyiAVFKTATLNLN 

MMMMMMMMMMMMMIMM M M M M M M M M M M M M M M 

D AAli VNN ALRAHVGCG FDG VGD I AKAKS E I YAGLCSDGMALI PQEDANMAVF KTATFNLN 
190 200 210 220 230 240 



250 260 270 280 290 300 

m081 pep TRTFGI DSGDVHAEN I VLKPLS CE FDLVCGDERAAVVLP VPGRHNVHNAAAAAALALAAG 

I MhMMMMMMMMMMMMMMMMMMMMMMMMMMM 

g081 TCTFGVDSGDVRAEN I VLKPLS CE FDLVCGDERTAWLP VPGRHNVHNAAAAAALALAAG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m081 . pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 

I I I ! I I I ! I h I 1 1 1 I I 1 1 I ! I h 1 1 1 1 I I 1 1 I I I I I I M I I I I 1 1 : 1 I I 1 1 I I I I M 1 1 

g 0 8 1 LSLNDVAEGLQGFSNI KGRLNVKAGI KGAT LIDDTYNANP DSMKAAVDVLARM PAP R I FV 

310 320 330 340 350 360 



370 380 390 400 410 420 

m0 8 1 pep MGDMGELGELGEDEAAAMHAEVGAYARDQGI EAAYFVGDNSVEAAEKFGADGLWFAAKDP 

MMMMI MIMMMMMMIMMMMMMMMMMMMMMIMI 

g 0 8 1 MGDMGELGE DE AAAMHAEVGAYARDQGI EAAY FVGDNSVE AAEKFGADGLW FAAKDP 

370 380 390 400 410 



430 440 450 

m081 . pep LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

I M 1 1 MMMMMMIMMMMMMMMI 

g081 LIQVLSHDLPERATVLVKGSRFMQMEEWEALEDKX 
420 430 440 450 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 25 1>: 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

2 51 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACGA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATAAT. GCCGTTTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCACATCG GATTGCCGTT GACTTTGTTG 

4 51 AAATTAAACG AAAAAC AC C G CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT TGACACAAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGCGAGATT TATCAAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

7 01 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

7 51 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 

801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

851 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT G AC AG CAT G A AAGCTGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC CGAAGTCGGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

12 51 GTTGATTCAA GTGTTGCGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

1301 TGAAAGGTTC GCGCTTTATG CAGATGGAAG AAGTGGTCGA GGCATTGGAG 

1351 GATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 252; ORF 08La>: 

a081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVEDVLAA GAAAVWSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQEDANMA VFKTATLNLN TRT FGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NECAAWLPV PGRHNVHNAA AAAALSLAAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

4 01 AAEKFGADGL WFAAKDPLIQ VLRHDLPERA TVLVKGSRFM QMEEWEALE 

451 DK* 

m08i/a08i 96.7% identity over a 455 aa overlap 

10 20 30 40 50 60 

m081 .pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVE DVLAA 

I I I I M I I I I I M I 11 I I I It I I I M I I M I I II I I I I I I I I I M I I I I I I I I I II I I I 
a 081 MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 

10 20 30 40 50 60 

70 80 90 100 110' 120 

m081 . pep GAAAVAA/ S RE DCAAMDGALKVDDTLAALQT LAKAWRENVN PFVFG I TGSGGKTTVKEM LA 

I M I II I II it I : I I I I I I I : I M I I M I I I I M I I I I I I I I I I I I II I I I I M 

a081 GAAAWVSREDCVAMDGALKVDDTLTALQMLAKAWRENVNPFVFGITGSGGKTTVKEMLA 

70 80 90 100 110 120 

130 140 150 160 170 180 

m081 . pep AVLRRRFGDDAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 
I I I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I M I I I I M M I I I II I I I I I t I I I I I 
a081 AVLRRRFGDNAVLATAGN FNNH I GLPLTLLKLNEKHRYAV IEMGMNHFGE LAVLTQIAKP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m081 .pep N AALVNN AMRAH VGCG FDG VG D I AKAK S E I YQG LC S DG I AL I PQE DANMAV FKT AT LN LN 

: I 1 I 1 I 1 I II I I I I I 11 I I M M I i I I M I I I ! I I! M : I 11 I I I! ! I I I I I I II I M I I 
a 081 DAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGMALI PQE DANMAV FKTATLN LN 

190 200 210 220 230 240 

250 260 270 280 290 300 

m081 . pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAVVLPVPGRHNVHNAAAAAALALAAG 
I I I I I I I I It I ! I I I 11 I I I I M I I I I I I I : I I I I I I I I I M I I I I I I I I I I I I : I I I I 
a081 TRTFGIDSGDVHAENIVLKPLSCEFDLVCGNECAAVVLPVPGRHNVHNAAAAAALSLAAG 

250 260 270 280 290 300 

310 320 330 340 350 360 

m081 .pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARM PAPRIFV 

I I M I I I I I I I I I I I I I I II I I I I I E I II I I I I I I ! I I I I I I I M 1 : M II I 11 I I I I I I 
a081 LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAVDVLARM PAPRIFV 

310 320 330 340 350 360 

370 380 390 400 410 420 

m081 . pep MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 
I | | I 11 I I I I M I 1 I I I I I I I I M I I 1 I I I M M I I M II I t I M I I I I I M I I 1 II 

a 081 MGDMGELGE DEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 

430 440 450 

m08l .pep L I QVLRHDL PERATVLVKGSRFMQMEE WEALE DKX 

I I I I II I I II M I I I I I II I I I I I I I I I I I I I I M I 
a081 LI QVLRHDLPERATVLVKGSRFMQMEE WEALE DKX 

420 430 440 450 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 253>: 



g082 . seq 








1 


aTGTGGTTGT 


TGAAGTTGCC 


TGCCGTCGCC 


51 


ACGGCGGCGC 


AATACCGCAG 


CCAGCATCTC 


101 


CCGAACCGGT 


AATGCCGAAC 


ACAAACGGGT 


151 


TTCGCCAACG 


TTTGCAACGC 


GGCAAGCGTG 


201 


GCCCAAAGCC 


GCGCAATCTT 


CGCGCGAAAC 


251 


CAGACAATAC 


GCCTCCAACA 


AAATCATGCG 


301 


AATGCGAAAA 


ACACATCGCC 


TTCCCGAATA 


351 


GCGCGACACG 


GGTTTGTTTT 


CAGACGGCAT 


401 


AGATGAAATT 


TAGGTCCAGT 


GGT TT CAT AT 


451 


GCGGCGGACA 


CATCGGTAGC 


GGCTGATTTT 


501 


GGTAAAACAC 


AGATTATTTT 


CCCATTCTCA 


551 


TATCATTTTT 


TAGACGTATT 


TTTAGC CGAT 


601 


CGGCGCGGGG 


TCGTCGGACT 


GTCTGTCGAT 


651 


CGCCCGGCAC 


ATCGGGGACA 


TTCCCCCAAA 


701 


AACTCGTCGG 


TTTCGATACC 


CGTCCAACTG 



This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 

g082 . pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFIFAFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGLSVD KGKVIAFARH IGDIPPKIIA VIGQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 25 5>: 

m082 . seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGc GGCGC AATACCGCAG CCAGCATTTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 
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201 ATC CATTGCA GCACAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCAG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CGTCAAACCG CTCGCCCGCC 

3 01 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

3 51 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TAGGTCCAGT GGTT TCAT AT TTACTTTCGT TAATATTCGG 
451 GCGGCGGACA CAT CGG T AG C GGCTGATTTT TTTATCGCCT GTTTTGCTGT 
501 GGTAAAACAC AG ATT AT TTT CCCATTCTCA TTCGGsATTT TTTCTGTACG 
551 TAT CATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 
601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 
651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AAT CAT AG C C GTCATCGGGC 
7 01 AACTCGTCGG TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 

This corresponds to the amino acid sequence <SEQ ID 256; ORF 082>: 

m082 .pep 

1 MXLLKLPAVA NT ASS PKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 AADTSVAADF FIACFAW KH RLFSHSHSXF FLYVSFFRRI FSRFAFSRIP 

2 01 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082.ng) 
from N. gonorrhoeae: 

m082/g082 

10 20 30 40 50 60 

MX LLKLPAVANTASS PKRRRNTAAS I S FTWLPPEPVMPNTNGFTFSRHAFASVCNAASV 

I MMIMMIIIMIIMIIIIMIMMIIIIMIMMIIhlMMMMIMM 

MWLLKLPAVAETASS PKRRRNTAAS I S FTVVLPPEPVMPNTNGFTLSRHAFANVCNAASV 
10 20 30 40 50 60 

70 80 90 100 110 120 

SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 

Illllll IIIIMMIIMIM II llllllll I I Ml II I MMMI M I MM. 

SSTFNAPPKAAQSSRETTTAAAPADNTPPTKSCASNRPPANAKNTSPSRI SRLSVTMRDT 
70 80 90 100 110 120 

130 140 150 160 170 180 

GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 

I I = 1 1 1 1 1 1 1 E 1 1 1 1 M I 1 1 1 I 1 1 = 1 1 1 E 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L I 1 I I 

GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 
130 140 150 160 170 180 

190 200 210 220 230 240 

FLYVSFFRRI FSRFAFSRI PRRGWGQSVDKGKVI AFALHIGNI PPKI I AVIGQLVGFDT 

I Mllll i I I I M I III Ml I I II II lllllllllll llhllllMIMIIIIIIII 

FLYVSFFRRI FSRFAFSRI PRRGWGLSVDKGKVI AFARHIGDI PPKI I AVIGQLVGFDT 
190 200 210 220 23 0 24 0 



m082 .pep 
9082 

m082 -pep 
g082 

m082 .pep 
g082 

m082 . pep 
g082 



mO 8 2 . pep RPTAESAX 

llllllll 
g082 RPTAESAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 25 7>: 

a082 . seq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAACGGCAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 
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101 


CCGAGCCGGT 


151 


TTCGCCAACA 


201 


ATCCATTGCA 


251 


CAGCCAATAC 


301 


AATGCGAAAA 


351 


GCGCGACACG 


401 


AG AT G AAAT T 


451 


GCGGCGGACA 


501 


GGTAAAACAC 


551 


TATCATTTTT 


601 


CGGCGCGGGG 


651 


CGCCCTGCAC 


701 


AACTCGTCGG 



AATACCGAAC 
TTTGCAACGC 
ACGCAATCTT 
GTCTTCAACA 
ACACATCGCC 
GGTTTGCTTT 
TAGGTCCAGT 
CATCGGTAGC 
AGATTATTTT 
TAGACGTATT 
TCGTCGGGCA 
ATCGGGAACA 
TTTCGATACC 



ACAAACGGGT 
GGTAAGCGTG 
CGCGCGAAAC 
AAATCATGCG 
CGCGCGGATG 
CAGACGGCAT 
GGTTTCATAT 
GGCTGATTTT 
CCCATTCTCA 
TTTAGTCGAT 
GTCCGTCGAT 
TTCCCCCAAA 
CGTCCAACTG 



TCACATTCTC 
TCATCGACTT 
CACAACCGCC 
CATCAAACCG 
TCGCGGCTGT 
CGGAAGCTTG 
TTACTTTCGT 
TTTATCGCCT 
TTCGGCATTT 
TTGCCTTTTC 
AAAGGCAAGG 
AATCATAGCC 
CCGAATCCGC 



GCGCCACGCC 
TCAACGCGCC 
GCCGCACCCG 
CCCGCCCGCC 
CGGTTACGAT 
AGGGCTTGGC 
TAATATTCGG 
GTTTTGCTGT 
TTTCTGTACG 
CCGCATACCA 
TTATTGCCTT 
GTCATCGGGC 
GTAA 



This corresponds to the amino acid sequence <SEQ ID 258; ORF 082.a>: 

a082 .pep 

1 MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN TNGFTFSRHA 

51 FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST KSCASNRPPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 A ADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

m082/a082 95.5% identity over a 247 aa overlap 



10 20 30 40 50 60 

m082 . pep MXLLKLPAVANTASSPKRRRNTAASISFTVVLPPEPVMPNTNGFTFSRHAFASVCNAASV 
I [ I I M ! I I : II I I I I M I I I I : I I I I 1 I I II I 1 I : M I I I M I I i II I t : : M I : I I 
a 0 8 2 MWLLKLPAVAKTALS PKRRRNT AAN I S FTWLPPE PV I PNTNGFT FS RHAFAN I CNAVS V 

10 20 30 40 50 60 



70 80 90 100 110 120 

m082 .pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 
| | 1 M 1 I I I I : I I I I I M I I I I I I I I 1 I I I I I I I I M I I I I II I I I M I 1 M I I II 1 I I 
a0 82 SSTFNAPSIATQSSRETTTAAAPAANTSSTKSCASNRPPANAKNTSPARMSRLSVTMRDT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m082 . pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSXF 
M | | I I I N I I I I I M M I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
a 082 GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAVVKHRLFSHSHSAF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m082 .pep FLYVSFFRRI FSRFAFSRIPRRGWGQSVDKGKVIAFALHIGN I PPKI I AVIGQLVGFDT 

M II I I I I I M I I I I I I I I I f I M I 1 I I I I I II I I I I I I I I I I I I I I M I I I I II I I I I I 
a082 FLYVSFFRRI FSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKIIAVIGQLVGFDT 

190 200 210 220 230 240 



m082.pep RPTAESAX 
MINIM 
a082 RPTAESAX 



The following partial DNA sequence was identified in AT. gonorrhoeae <SEQ ID 259>: 

g084 . seq 

1 ATGAAacaAT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaTCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGC CGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

2 01 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

2 51 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

3 01 ATATTGGAAA GCAATC CTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 
3 51 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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4 01 TTTGGAAATA TTGTGTATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGTCGCA GCAAAATATG GCTGACCATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 

551 atgccggcct gttgttgaat ATTTTcgacc tgtattaCga cttggctttc 

6 01 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcattttg gaagCagcaa 

6 51 aaaaagcgtC AACATGGCAt atccgccaac ttgcgcccaa gTAtaa 

This corresponds to the amino acid sequence <SEQ ID 260; ORF 084.ng>: 

g084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYR YSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW KYCV5 VGVFADVKNY 

151 KRRSK IWLTI LLTLI LSCAV ME KIAGDKDW RE PDAGLLLN IFDLYYDLAF 

201 RAGTICRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 26 1>: 

m084 . seq 

1 ATGAAACAAT CCGCCcGAAT AAAa.ATATG AAT CAG AC AT T ACT TT AT AC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTAT CAG AT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TT TC ATT T TT GGCTTGACAG 

401 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG G CTG ACT AT A TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT . TC 

601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 262; ORF 084>: 

m0 84 . pep 

1 MKQSARIKXM NQTLLYTLGI CALLTFXXXX XXXXX YHPEY EYGYR YSAVG 

51 ALASWFLLL LAR GFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS GGVFADVKNY 

151 KRRS KIWLTI LLTLI LSCAV MDKIA5DKDL RE PDAGLLLN I FDLY YDLAX 

201 RAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORP from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from TV! gonorrhoeae: 

m084/g084 



10 20 30 40 50 

mO 8 4 . pep MKQSARI KXMNQTLLYTLGI CALLTF YHPE YE YGYRYSAVGALAS WFLLL 

MINIM hill IMIMIhl M MMMMMIMMMMMI 

g0 8 4 MKQSARI KNMDQTL KNT LG I CALLAFCFGAAIASGYHLE YEYGYRYSAVGALASWFLLL 

10 20 30 40 50 60 



60 70 80 90 100 110 

m084 . pep LARG F P RV S S WLL I YVGTT AL Y L P VGWL YGAP SYQIVGSI LE SN P AE ARE FVGNL PGS L 

IMMMM M MMM M Ml M MM Ml MMMMMIMMMMMI MM Ml 

g084 LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 



120 130 140 150 160 170 

m084 . pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

1 1 II I I M 1 1 1 M I M I II I IIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIhIM 

g084 YFVQALFFI FGLTVWKYCVSVGVFADVKNYKRRSKIWLTI LLTLI LSCAVMEKI AGDKDW 

130 140 150 160 170 180 
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180 190 200 210 220 

m0 8 4 pep RE PDAGLLLNI FDLY YDLAXRAGTI CRQARPHFGSS KKS VNMAYPSCCAQVX 

IIIIIIIIIIIIIIMIII I MM I IMI I I III II llll I II I Mill 

g 084 RE PDAGLLLN I FDLY YDLAFRAGTI CRQARPHFGS S KKS VNMAYP PTCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 263>: 

a084 . seq 

1 ATGAAACAAT CCGCCCGAAT AAAAAATATG GATCAGACAT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

2 01 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 TTTGGAGATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG AT G G AT AAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 . GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084.a>: 

a084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYR YSAVG 

51 ALASVVFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTVW RYCVS GGVFADVKNY I 

151 KRRSK IWLTI LLTLILSCAV MD KIAS DKDL RE PDAGLLLN I FDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

m084/a084 92.2% identity over a 23 1 aa overlap 



10 20 30 40 50 60 

m084 . pep MKQSARIKXMNQTLLYTLGICALLTFXXXXXXXXXYHPEYEYGYRYSAVGALASVVFLLL 
I I I I I j I I 1:11! 11111111:1 M I II i I if I I II I I I I I I I I I M 

a 08 4 MKQSARIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m084 .pep LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 
I I I i I I I I I I I I I I I I I I I I I I I I I I I I M I I I i I I I 11 I M M I II I I I I I II I I I I I I 
a 08 4 LARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 



130 140 150 160 170 180 

m084 . pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I i I i I I I I I I I I I I M I I I I 
a 08 4 YFVQALFFI FGLTVWRYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

130 140 150 160 170 180 



190 200 210 220 230 

m0 8 4 . pep RE PDAGLLLN I FDLYY DLAXRAGT I CRQARPH FG S SKKSVNMAY PSCCAQVX 

I I I I I I I I II I 1 1 M I I I I I I I I I I I M I I I I I I I I I I II I I I I I I 1 II I 
a 08 4 RE PDAGLLLN I FDLY YDLASXAGT I CRQARPHFGS SKKSVNMAY PSCCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 265>: 

g085 . seq 

1 ATGGG CAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGT TGAAAGATAA 
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51 GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CGGCATACGC CCAAGCCGAA GCGGGCGATA TTGTCTTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ED 266; ORF 085. ng>: 

g 0 8 5 . pep 

1 MGKGQDFTPL RDALKDKAKG VFLIGVDAPQ I RRDLDGCGL NLTDCVTLEE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 267>: 

m0 85 . seq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGATGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGGGAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 08 5>: 

m085 -pep 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ I RRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085. ng) 
from N. gonorrhoeae: 

m085/g085 

10 20 30 40 50 60 

m08 5 .pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQ I RRDLDGCGLNMTDCATLGE AVQTAYAQAE 

II I! II I I ! I I I II M I 1 1 i M I 1 1 1 1 1 M I II ! I It i I ^ 1 1 1 = 1 1 I I M ! I 1 I 1 M 

g 085 MGKGQDFTPLRDALKDKAKGVFLI GVDAPQ I RRDLDGCGLNLTDCVTLE EAVQTAYAQAE 

10 20 30 40 50 60 

70 80 90 

m085 . pep AGDI VLLS PACAS FDMFKGYAHRS EVF I EAFKALX 

MIIIIMMMIIIMIIIIMIIIIMMMM 

g 0 8 5 AGD I VLLS PACAS FDMFKGYAHRS EVF I EAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 269>: 

a085 . seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

2 51 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 270; ORF 085. a>: 

a085 .pep 

1 MGKGQDFTPL RDALAGKAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/a085 94.7% identity over a 94 aa overlap 

10 20 30 40 50 60 

m085 . pep MGKGQDFTPLRDALVGKAKGVFLIGVDAPQIRRDLDGCGLNMTDCATLGEAVQTAYAQAE 
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I | I I I I I I I I I II I : I I I M I M I I I I II I M I I I I II MINIMI II I I M I I M 
a085 MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 

10 20 30 40 50 60 

70 80 90 

m085 . pep AGDIVLLSPACASFDMFKGYAHRSEVFIEAFKALX 
M I I I I I I I I I I I I I M M I I M M I I I 11 I I I I 
a08 5 AGDI VLLS PACAS FDMFKGYAHRSEVFIGAFKALX 

70 80 90 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 27 1>: 

g086 - seq 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAGCCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGGCACGT 
4 51 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 
601 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 
651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 
701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGC CGGCTA CCAGCTTACC 
751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGT TCGGT A TGGGTTTGGG 
801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 
851 TTTTTGCCAT CATCGCTGAA GAATTCGGCT TCTT CGGG AT GTGCGTGCTG 
901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 
951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA AT AT CGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg tTGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATCGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>: 

g086 -pep 

1 MWLMTAFGL LMIYSASV YL ASKEGGDQFF YLTRQ AGFW AGLIASGFLW 

51 FLC RMRTWRR L VPWIFALSG LLLVAVL I A G REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LAS LFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETL.EMYGRF R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VI TVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 

2 51 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EFGFFGMCVL 

3 01 I FCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SF FNIGVNIG 
3 51 ALPTKGLTLP LMSYG GSSVF FMLISMMLLL RI DYENRQKM RGYRVE * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 273>: 

m086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

2 01 CCTATCCGGC CTGTTGCTGG TAG T CGT ATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGcGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 
4 51 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 
501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
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5 51 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tg . AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA T C AG C ATG AT GCTGCTGTkG CG TAT AG ATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 274; ORF 086>: 

m086.pep 

1 MWLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQA GFW AGLIASGLLW 

51 FLC RMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQXR 

151 RE TLEMYGRX R AIILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

2 01 GLPW KYFFVL VGSVLGGMVL MITA APYRVQ RWAFLDPWK DPQGAGYQLT 
251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAI IAE EFGFFGMCVL 
301 I FCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ SFFNIGVNIG 

3 51 ALPXKGLTX P XMSXG GSSVF FMLI SMMLLX RI DYENRRKM RGYRVE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from gonorrhoeae: 

m086/g086 

10 20 30 40 50 60 

MWIJ^TAFSLIJ^IYSASVYI^KEGGDQFFYLTRQAGFWAGLIASGLLWFLCRMRTWRR 

IIIIMIhllllllllllllllMIIIIIIIIIIIIIIMIIIIIhllllllllllM 

MVVLMTAFGLLMI YSASVYLAS KEGGDQFFYLTRQAGFVVAGL I ASGFLWFLCRMRTWRR 
10 20 30 40 50 60 

70 80 90 100 110 120 

LVPWIFALSGLLLVWLIAGREINGATRWIPLGPLNFQPTELFKLAVILYLASLFTRREE 

I M MM I M M MM I MM Ml 1 1 I M M I I MM MM M M M MM M M M I 

L V PW I F ALS GLLL V AVL I AGRE I NGATRW I P LG P LN F Q P TE L F KLAV I L YLAS L FT RRE E 
70 80 90 100 110 120 

130 140 150 160 170 180 

VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFGLVLIMVQ 

MIIIMIIIIIIIIIIIIMIIIIIII i 1 1 ! 1 1 I 1 1 I III I MM I MIMII MM 

VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAI ILPIMLVAFGLVLIMVQ 
130 140 150 160 170 180 

190 200 210 220 230 240 

PD FG S FW I TV I AVGML FLAG L P W KY F FVL VG S VLGGMVLM I T AAP Y R V QR WAF LD P W K 

I I i ! I J 1 1 1 1 1 i ^ ! 1 1 1 1 1 1 1 ; F 1 1 : 1 1 1 1 1 1 : 1 1 j i f 1 1 1 1 [ l ] 1 1 { i l I j 1 1 1 1 1 i 

PDFGS FW I TV I TVGMLFLAGL PWKY F FVLVGS VLGGMVLM I TAAP YRVQRWAFLD PWK 
190 200 210 220 230 240 

250 260 270 280 290 300 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDF I FAI IAE EFGFFGMCVL 

1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I I M 1 1 I I M I 1 1 1 1 1 1 

DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI lAEEFGFFGMCVL 
250 260 270 280 290 300 

310 320 330 340 350 360 



m086 .pep 
g086 

m086 . pep 
g086 

m086 . pep 
g086 

m086 . pep 
g086 

m086 . pep 
g086 
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m086 pep IFCYGWLWRAFSIGKQSRDLGLTTOAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 

II II IMM M I II M MM IM M I I MMM IMM i MM III M MM: M I I | 

gO 86 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 • 330 340 350 360 

370 380 390 

m086 . pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 

I! MIIIMIIIIMM I I i ll MMiil i i M I 

g0 86 LMS YGGSS VFFMLI SMMLLLRI DYENRQKMRGYRVEX 

370 380 390 



The following partial DNA sequence was identified in K meningitidis <SEQ ID 275>: 

a086.seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG G CAT C AAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

4 51 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

7 01 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

7 51 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 276; ORF 086.a>: 

a086.pep 

1 M WLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQA GFW AGLIASGLLW 

51 FLC RMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFGS FWIT VIAVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EF GFFGMCVL 

301 IFCYGWLW R AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

351 ALPTKGLTLP LMSYG GSSVF FMLISMMLLL RI DYENRRKM RGYRVE* 

m086/a086 98.0% identity over a 396 aa overlap 

10 20 30 40 50 60 

ni08 6 . pep MWLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFWAGLIASGLLWFLCRMRTWRR 
I | | | I I I 1 I I I II I I I I I I I I I I I II I I I I I M I I I I I M I I I I I I I I I I II I II I I I t I 
a 08 6 MVVLMTAFSLLMI YSASVYLASKEGG DQFFYLTRQAG FWAGLIASGLLW FLCRMRTWRR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m08 6.pep LVPWIFALSGLLLVWLIAGREINGATRWIPLGPLN FQPTELFKLAVILYLASLFTRREE 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I M 
a 086 LV PW I FALSGLLLVWLI AGRE INGATRW I PLG PLN FQPTELFKLAVI LYLAS LFTRREE 
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70 80 90 100 110 120 

130 140 150 160 170 180 

m086.pep VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAI ILPIMLVAFGLVLIMVQ 

I M I I I I I I I I I I I I I I M I t I 1 II I It I I I I I II I I I I I I I I I I I I I I I I I I M II I 
a08 6 VLRSMESLGWQS I WRGTANLIMSATNPQARRETLEMYGRFRAI ILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m086.pep PDFGSFVVITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 
I M I I I I I I I I I I I M M I I I I I I I I II I I I I I I I I I I 1 I I I I I I II I I ! I I I II I I I 11 
a 086 PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m08 6.pep DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI IAEEFGFFGMCVL 

I M 1 I M t I I I I I I I I II I I I I I I M I I I I I I I M E I I I M II II I I I II I I I I I I I I I I 
a 086 ■ DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAI IAEEFGFFGMCVL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m08 6.pep IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 
I I I I I I I M I I 11 M I I I I I I I I I I I I I I I I i I M I I I I I II I I I I f I II I I : I I I I I 
a08 6 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

mO 8 6 . pep XMSXGGSSVFFMLI SMMLLXRI DYENRRKMRGYRVEX 

II I I I I I I I I I I II I I I I Ml I I I I I I I I I I M I 
a 0 8 6 LMS YGGS SVFFMLI SMMLLLRI DYENRRKMRGYRVEX 

370 380 390 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

:ggaacgg gcggacacat 
3cgtgcgc ggtcatcatg 
3agcgcat cgtgccgcaa 
\ggaatac gcggcaacgg 



ACGCTTTTCC GAAAGCGTTC 



g087 . seq 








1 


ATGGGCGGTA 


AAACCTTTAT 


GCTGATGGCG 


51 


TTTCCCAGCT 


CTGGCTGTGG 


CGGATTCATT 


101 


TAATTTGGCT 


GGGCAGCAAG 


GATTCGATGG 


151 


TACGGCATAC 


GCTTGGAAAC 


GCTGGCGATT 


201 


CAT CAAACG C 


AAGCTGATGC 


TTCCGTTTAC 


251 


AAGCGCAGCG 


GATTATCCGC 


AAACACCGTG 


301 


GGCGGTTTTG 


TTACCTTTCC 


CGGCGGTCTG 


351 


GCCGATTGTG 


ATTCACGAGC 


AAAACGCCGT 


401 


AccTGTCGCg 


C t GGGCGAAA 


CGGGTGTTGT 


451 


AGCCACGAAG 


GCGGTT TGGT 


CGGCAACCCC 


501 


CCTGCCCGTG 


CCTGCCGAAC 


GCTTCCAAGG 


551 


TTTTGGTGGT 


CGGCGGCAGT 


TTGGGTGCGG 


601 


CCGCAGGCGT 


TGGCACTGCT 


GCCTGAAGAG 


651 


CCAGTCGGGG 


CGTAACAAGC 


TGGGCAATCT 


701 


TGGGCGTGAA 


AGCGGAATGC 


GTGGAATTTA 


751 


T AC CGTGATG 


CCGATTTGGT 


GATTTGCCGT 


801 


CGAGTTGACG 


GCGGCGGGGC 


TGGGCGCGTT 


851 


CCGTTGATGA 


CCATCAAACC 


GCCAACGCGC 


901 


GCGGGGCTGC 


TGTTGCCGCA 


AACCCAGTTG 


951 


AATCCTCGGC 


AGCCTCAACC 


GCGAAAAATG 


1001 


CCCGTACGTT 


GGCATTGCCG 


CACAGCGCGG 


1051 


ATTGCGTGTG 


CGGCGTAAA 





CAAAACCGTA 



GACGTTGC CGAAGCCGCG 

This corresponds to the amino acid sequence <SEQ ID 278; ORF 087.ng>: 

g087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRVR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGIRGNGIKR KLMLPFTLYK TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVP I V IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 
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2 01 PQALALLPEE VRPQMYHQSG RNKLGNLQAD YDALGVKAEC VEFITDMVSA 

2 51 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 

3 01 AGLLLPQTQL TAEKLAEILG SLNREKCLKW AENARTLALP HSADDVAEAA 
3 51 IACAA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 279>: 

m087 . seq 



1 ATGGGCGGTA AAACCTTTAT GCTGAwkkCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TGATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGTAT CGTGCCGCAA 

151 TACGGCATAC GCTTGGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

2 01 CAT CAAACGC AAACTGATGC TGCCGGTTAC TTTGTATCAA ACCGTCCGCG 

2 51 AAGCGCAGCG G ATT AT C CG C AAACACCGTG TCGAGTGCGT CATCGGCTTC 

3 01 GGCGGCTTCG TTACCTTCCC CGGCGGTTTG GCGGCGAAGC TATTArGCGT 

3 51 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGT TTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4 51 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG AT AT TAG C AA 

5 01 CCTGCCCGTG CCTGCCGAAC GCTTC CAAGG GCGTGAAGGC CGTCTGAAAA 
5 51 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 
601 CCGCATGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AT ATGT AC C A 
651 CCAATCGGGA CGGGGCAAGC TGGGCAT CTT GCAGGCGnnn nnnnnnnnnn 
7 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

7 51 nnnGCGGGAT TGGGTGCGTT GTTAGTG CCG TATCCTCACG CGGTTGACGA 

8 01 TCACCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG GCGGGATTGC 
851 TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA GATTCTCGGC 

9 01 GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG CCCGTACGTT 
951 GGCACTGCCG CACAGTGCGG ACGACGTGGC GGAAGCCGCG ATTGCGTGTG 

10 01 CGGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 280; ORF 087>: 

m087 . pep 

1 MGGKTFMLXX GGTGGH I FPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 



51 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AAKLLXVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

2 01 PHALALL PDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 

2 51 XAGLG ALL V P YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 

301 GLNREKCLKW AENARTLALP HSADDVAEAA IACAA* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087.ng) 
from TV. gonorrhoeae: 

m087/g087 



m087 . pep 



10 20 30 40 50 60 

MGGKTFMLXXGGTGGHIFPALAVMSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 



g087 




10 20 30 40 50 60 



m0 87 . pep 



70 80 90 100 110 120 

KGVRGNG I KRKLMLPVTLYQTVRE AQRI I RKHRVECVI GFGGFVTFPGGLAAKLLXVP I V 



9087 




KRKLMLPFTLYKTVREAQRIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 
70 80 90 100 110 120 



m087 . pep 



130 140 150 160 170 180 

I HEQNAV AGL S NRH LS RW AKR VL Y A F P KAF S HE GG LVGN P VRAD I S NL P V P AE R FQ GRE G 



g087 




130 140 150 160 170 180 
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m087 . pep 



190 200 210 220 229 

RLKILWGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQA 



9087 




LWGGSLGADVLNKTVPQA1ALLPEEVRPQMYHQSGRNKLGNLQADYDALGVKAEC 
190 200 210 220 230 240 



m087 .pep 



230 240 250 

- - AGLGALLVPYPHAVDDHQTANARFMVQAE 




g087 



VE F I TDMVS AY RDADLV I CRAGALT I AELTAAGLGAL.LV PYPHAVDDHQTANARFMVQAE 
250 260 270 280 290 300 



260 270 280 290 300 310 



m087 .pep 



AGLL L PQTQ LT AE KLAE I LGGLNRE KC L KW AEN ART LAL P H S ADD VAE AA I AC AAX 



g087 



AG LLL PQTQ LTAE KLAE I LGSLNREKCLKWAENARTLALPHS ADDVAEAAI ACAAX 
310 320 330 340 350 




The following partial DNA sequence was identified in N. meningitidis <SEQ ID 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GG CC AT CAT G 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TGCCGTTTAC TTTGTATCAA ACTGTCCGCG 

251 AAGCGCAGCA GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG GCGGCGAAGT TATTAGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4 51 AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AGATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT GCAGGCGGAT TACGACGCGC 

7 01 TGGGCGTGCA AGCGGAATGC GTGGAATTTA TTACCGATAT GGTGTCCGCC 

751 TACCGCGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 

801 CGAGTTGACG GCGGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG 

851 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG 

901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 

951 GATTCTCGGC GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG ACGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 282; ORF 087. a>: 

a087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YDILLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLGVFIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG RGKLGSLQAD YDALGVQAEC VEFITDMVSA 

251 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 IACAA* 

m087/a087 85.4% identity over a 355 aa overlap 



10 20 30 40 50 60 



m087 . pep 



a087 



MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 

I I I I I I I I I M I II ft i I I I I M I I I I t I M I I I I I I I I I I I I I I I I i I I II I I I I 
MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 



10 20 30 40 50 60 
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70 80 90 100 110 120 

m087 . pep KGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGFVTFPGGL/mKLLXVPIV 
Ml I I I I I M I I I! I M I I I IE I I I : M ! M I I II M I I I I II I I I I I I M 1 I 1 I IE I 
a 087 KGVRGNGIKRKLMLPFTLYQTVREAQQIIRKHRVECVIGFGGFVTFPGGLAAKLLGVPIV 

70 80 90 100 110 120 



130 140 150 160 170 180 

m087 .pep IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
I I I I j II | I I I I M I I I 1 M I I I I I I I I I II I 1 M I I 1 I I M I I I I ! I I I I E I I ! I I IE I 
a087 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m087 . pep RLKILWGGSLGADVLNKTVPHAI^ALLPDNARPHMYHQSGRGKLGILQAXXXXXXXXXXX 

M I I I I I II I I I I I I I I I I I I : E I I I I I I I I I I : I I I It 1 I I I I I III 
a08 7 RLKILWGGSLGADVLNKTVPQALALLPDNARPQMYHQSGRGKLGSLQADYDALGVQAEC 

190 200 210 220 230 240 

250 260 270 280 

m087 pep XX XXXXXXXXXAGLGALLVPYPHAVDDHQTANARFMVQAE 

: : I I I I I I I! I I I I I 1 I I t I IE t I I I M I I I 

a087 VEFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 

250 260 270 280 290 300 



290 300 310 320 330 

m087 . pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 
M I II I I I II I I I I I I I I M I 1 I I 1 I I I I M I I I I 1 I I I I I M IE I I I I I I I I I E I 
a 0 8 7 AGLLLPQTQLTAEKLAE I LGGLNREKCLI£WAENARTLALPH S ADDVAEAAI ACAAX 

310 320 330 340 350 



The following partial DNA. sequence was identified in N. gonorrhoeae <SEQ ID 283>: 



g088 . seq 




1 


ATGTTTTTAT 


51 


TTTTCAATAC 


101 


CGTTTTCCCT 


151 


AAATGCGGGC 


, 201 


AAACGGCACG 


251 


TGTCCACCCT 


301 


TTGGGCGTAC 


351 


CAAAGTCGTT 


401 


TGTGGCAGTC 


451 


gCcgcCAATT 


501 


CGCCCTGCCG 


551 


TCGTCGGCAC 


601 


GCCGCcttcc 


651 


CGTCAGCGGA 


701 


CCGGCGCGAA 


751 


CTCGGATTTT 


801 


TGTCGGCGCG 


851 


tCCGCCAAGA 


901 


gccgtgTCCG 


951 


CATCTTc CTg 


1001 


aagaaacgca 


1051 


tTgataggtt 


1101 


ACCTTTCAGA 



3CATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284; ORF 088. ng>: 

g088 . pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFS LMF G PWT I RRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANP YIWIL 

101 LGVLLATGAL GFY DDWRKW YKDPNGVSAK FKMVWQ5S VA VIAGLALFYL 

151 AANSANNILI VPFFKQIA LP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 
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2 01 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA IFCTAMCGAC 
251 LGFLWFNAYP AQV FMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

3 01 AVSVMLHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIVW 
3 51 LIGL STLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 285>: 

m088 . seq 



2_ 


1 \J 1 1 i i 1 -»» i 


GGCTCGCACA 


T TT C AG C An C 


TGGTTAACCG 


GTCTGAATnn 


^ 1 


nnnnnnnnnn 

11111 11 11 11 llli.il. ill 


nnnnnnnnnn 
i ii ii ii iiiiiiii 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


101 


nnnnnnrinnri 

111 11 11 11 11111111 m 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnrm 
in mi iiiiiiiiiiiii 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


2 01 


nnnnnnnnnn 
in ii li li li n ii ii in 


nnnnnnnnnn 
niiiiiiiiiiiiiii^^^ 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


4. D JL 


nnnnnnnnnn 

111 Li 111111 11 1111111 


nnnnnnnnnn 
liiiiiiiiiiiiiiniLi 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


o yj x 


nnnnnnnnnn 
iiiniiiiiiiiiiiiiii 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


351 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


401 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


451 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


501 


nnnnnnnnnn 


nnnGGCGTGG 


TCGGCTTTTT 


GGTGTTGTCT 


TACCTGACCA 


551 


TCGTCGGCAC 


ATCCAATGCC 


GTCAACCTCA 


CCGACGGCTT 


GGACGGCCTT 


601 


GCGACCTTCC 


CCGTCGTCCT 


CGTTGCCGCC 


GGCCTCGCCA 


TCTTCGCCTA 


651 


TGCCAGCGGC 


CACTCACAAT 


TTGCCCAATA 


CCTGCAATTA 


CCTTACGTTG 


701 


CCGGCGCAAA 


CGAAGTGGTG 


ATTTTCTGTA 


CCGCCATGTG 


CGGCGCGTGC 


751 


CTCGGTTTCT 


TGTGGTTTAA 


CGCCTATCCC 


GCGCAAGTCT 


TTATGGGCGA 


801 


TGTCGGTGCA 


TTGGCATTGG 


GTGCCGCGCT 


CGGTACCGTC 


GCCGTTATCG 


851 


TCCGCCAAGA 


GTTTGTCCTC 


GTCATTATGG 


GCGGATTATT 


TGTCGTAGAA 


901 


GCCGTATCCG 


TTATGCTTCA 


GGTTGGCTGG 


TATAAGAAAA 


CCAAAAAACG 


951 


CATCTTCCTG 


ATGGCGCCCA 


TCCATCACCA 


CTACGAACAA 


AAAGGCTGGA 


1001 


AAGAAACCCA 


AGTCGTCGTC 


CGCTTTTGGA 


TTATTACCAT 


CGTCTTGGTG 


1051 


TTGATCGGTT 


TGAGTACCCT 


CAAAATCCGC 


TGAAC C TAT G 


CCGTCTGAAC 


1101 


ATCTTTCAGA 


CGGCATTTGA 


ACGCGCAATA 


A 





1 MFLWIiAHFSN WLTGLNI FQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSAN NILI VPFFKQI ALP LGWGFLVLS YLTIVG TSNA VNLTDGLDGL 

201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA IFCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIVR QE FVL VIMGGLFWE 

301 AVSVM LHVGW YKKTKKRIFL TAPIHHHYQL RCWKET QVW RFWIITIVW 

351 LIGLSTLKIR GNYAVRTPFR RHLNAQ* 



This corresponds to the amino acid sequence <SEQ ID 286; ORF 088>: 

m088 .pep 

1 MFLWLAHFSX WLTGLNXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

101 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAI FA YASG HSQFAQYLQL P YV AG ANE W IFCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFWE 

301 AVSVM LQVGW YKKTKKRIFL MAPIHHHYEQ KGW KETQ VW RFWIITIVLV 

351 LIGL STLKIR XTYAVXTSFR RHLNAQ* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 088 shows 91.7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from TV. gonorrhoeae: 

m088/g088 

10 20 30 

m088 . pep GWGFLVLSYLTIVGTSNAVNLTDGLDGLA 

I !l I II illl I! I M II III! Ill I I MM 

g0 88 IAGIJU^FYLAANSANNILIVPFFKQIALPL^WGFLVLSYLTIVGTSNAVNLTDGLDGLA 
150 160 170 180 190 200 
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40 50 60 70 80 90 

m088 . pep tfpwlvaaglaifayasghsqfaqylqlpyvaganewifctamcgaclgflwfnaypa 
:M I 1 1 1 I I I! M I h I I I MMIIMMMMMMMMiMiMMMMIMM 

g 0 8 8 aFPFVLVAAGIjAIFAYVSGHYQFSQYLQLPYVAGANEVAIFCTAMCGACLGFLWFNAYPA 
210 220 230 240 250 260 



100 110 120 130 140 150 

m088 pep QVFMGDVGALALGAALGTVAVI vrqefvlvimgglfweavsvmlqvgwykktkkri flm 

I 1 1 J 1 1 1 r I E 1 1 1 1 1 1 i 1 1 1 1 f I ( 1 1 1 1 1 1 1 1 E 1 1 1 1 1 ] E 1 1 1 ! 1 = 1 1 1 1 1 i 1 1 1 1 1 1 1 

g 0 8 8 QVFMGDVGAIiALGAALGTVAVI VRQEFVLVIMGGLFWEAVSVMLHVGWYKKTKKRI FLT 

270 280 290 300 310 320 



160 170 180 190 200 

m088 . pep AP I HHHYEQKGWKETQ WVRFW 1 1 T I VLVLI GLSTLKI RXT Y AVXTS FRRHLNAQX 

llllllh : IMIMMMMMMMIMMMIII MM ! MINIMI 

g 088 APIHHHYQLRCWKETQVWRFWIITIVWLIGLSTLKIRGNYAVRTPFRRHLNAQX 
330 340 350 360 370 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 287>: 

a088 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

251 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 

301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

351 CAAAGTCGTC TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

4 01 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 

4 51 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 

501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 

551 TCGT CGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 

601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 

651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 

7 01 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 

7 51 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 
801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 

8 51 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 
901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 
951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA TTATTACCAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 288; ORF 088. a>: 

a088 .pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANPYIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA IIAGLALFYL 

151 AANSANNILI VPFFKQIA LP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPWLVAA GLAIFA YASG HSQFAQYLQL PYVAGANE W I FCTAMCGAC 

2 51 LGFLWFNAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIMGGLFVVE 

301 AVSV MLQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VW RFWIITIVLV 

351 LIGLSTLKIR *TYAV*TPFR RHLNAQ* 



m088/a088 99.5% identity over a 205 aa overlap 

150 160 170 180 190 200 

m088 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 

I i It I I I I I I I M I i I I I I I I I I I I I I M I 
a 088 IAGLALFYLAANSANNILIVPFFKQIALPLGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 
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ISO 160 170 180 190 200 

210 220 230 240 250 260 

m088 .pep TFPVVLVAAGLAIFAYASGHSQFAQYLQLPYVAGANEWIFCTAMCGACLGFLWFNAYPA 
1 | | I I I I II M I I I I I I I I I I I I M II I M 1 I I I M I M M I I i I I I I i I M I I I I I M I 
a 088 T F PVVLVAAGLAI FAYASGHSQ FAQYLQLPY VAGANE W I FCTAMCGACLG FLW FNAYPA 

210 220 .230 240 250 260 

270 280 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 
I | I I I II I I I I I I I M II I I I I I M I I I I I I I M I I I I I I I M I I I I I I ! I M I I I I M I 
a 088 QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFWEAVSVMLQVGWYKKTKKRIFLM 
270 280 290 300 310 320 

330 340 350 360 370 

m088.pep APIHHHYEQKGWKETQVVVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I I I I I I I I I I I I M I I I I I I II I I I I ! I I I I I I M M I I M I I M I MINIMI 
a088 APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 289>: 

g089 . seq 

1 ATGCCGCCCA AAATCACGAA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

2 01 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 
251 TGTCTTGCGT AGAAATCAAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 
301 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACCGTTTCAC 

3 51 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCA 

4 01 AAACC TGC AC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

g089 . pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29 1>: 

m089 - seq 

1 ATGCCGCCCA AAAT CACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG CCTCACACAA 

2 01 AG C C AC kG C A GCCATGACGT TAGCGGCGTT GTGCAkACCT TGCAACGGwA 

2 51 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG GCGGCCTGTC 

3 01 TCGCGTTCCA ACCAGAAATC AGCTTCGTGT T C CAACG AAA ACCATTTTAC 
351 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACG CAT CG TCCGCATTCA 

4 01 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

m089 - pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLLSSI NTTPFFSPIF STRCGRPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCXP CNGMSCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from N. gonorrhoeae: 

m089/g089 
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10 20 30 40 50 60 

m089 .pep MPPKI TXSGFCKPAI AAA VAPTFVPLLSS INTTPFFS P I FSTRCGRPWKVLTCSSNASRD 

MINI II MMIIII IMIIM MIlhMI illl MIMIIMMMII M II M 

g08 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPI FSTRCGKPWKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 9 .pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 

II I MINIMUM I I hi MM I II II II h HI I II I I M I I h 1 1 M II I I 

g0 89 KPTASHKATAAITLAALCKPCSGMSCVEIKSSLPCFKQPVPRSNQKSASCSKENRFTSRP 

70 80 90 100 110 120 



130 140 150 

m089 . pep ARF I ARQNASS AFKTCT PS PRKI LALVCAX 

It I M I II = 1 1 1 1 I M I I 1 1 1 1 1 I 1 1 1 1 ! 

g08 9 AR FMARQNTSS AFKTCT PS PRKI SALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 293>: 

a089. seq 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT TGCAGCGGAA 

251 TATCTTGCGT GGCAATCAAA TCTTCATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA ACCAAAAATC GGCTTCGTAT TCCAACGAAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGCATCG TCCGCATTCA 

4 01 AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORF 089.a>: 

a089 .pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 



m089/a089 91.9% identity over a 149 aa overlap 

10 20 30 40 50 60 

m089.pep MPPKITXSGFCKPAIAAAVAPTFVPLLSS INTTPFFS PI FSTRCGRPWKVLTCSSNASRD 

I I I I I I I I I M I I I I II I I I II M I M I : I I I I I I I I I I I I I I I M M I I I I I I I II 
a 08 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGRPXKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 9.pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
|| | | | | | M I : I I : I I I 1 I : I : I I I : I 11 I I I I I I I I t II I I I I I II MINIUM 
a 08 9 KPTASHKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSASYSNENHFTSRP 

70 80 90 100 110 120 



130 140 150 

mO 8 9 . pep ARFI ARQN ASS AFKTCT PS PRK I LALVCAX 

I I I I I I I I I II I I I I I I I I I M I I I I I I I I 
a089 ARFI ARQN AS S AFKTCT P S PRK I LALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 295>: 

g090 . seq 

1 ATGCGCGTAG T CGAGCAAAT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 



WO 99/057280 



285 



PCT/US99/09346 



SI TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GG CTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gatATTGCCC GCAATG Ac g g 

201 tatccagccc gcacttgatg CAGAGatagc ggaccaggct ggttaccgTG 

251 GTTttgccgt tgctgCcggt aatcgCaatc accttgtcgC CGCGGCGGtt 

301 cAcaaTGTCc gccaGCAATt ggATGTCGCC TAgCACGCGC . ccgccgTTT 

351 TGCttga 

This corresponds to the amino acid sequence <SEQ ID 296; ORF 090.ng>: 

g090 . pep 

1 MRWEQIWA VEMVFGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLQDGRT DIARNDGIQP ALDAEIADQA GYRGFAVAAG NRNHLVAAAV 

101 HNVRQQLDVA XHAXRRFA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 297>: 

m090 . seq 

1 ATG CGC AT AG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

2 01 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

2 51 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT .CCGCCGTTT 

351 TGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m0 90 . pep 

1 MRIVEQVWA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LP FRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQFDVA QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 090 shows 83.9% identity over a 1 18 aa overlap with a predicted ORF (ORF 090.ng) 
from N. gonorrhoeae: 

m090/g090 

10 20 30 40 50 60 

m090 . pep MRIVEQVWAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 

i hli-l i I MlllilhIMM , :| I I Mill I II I I lllllllllll I Ih I 

g090 MRWEQIWAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 

10 20 30 40 50 60 

70 80 90 100 110 119 

m090 . pep D I ARDNGI QPALDTE I ADQARYRGFAVAAGNRNYLWPAVHNVRQQFDVAQHAXRRFAX 

II I h:|| I I M hi II I I I IIMIIMIIIhlh lllllllhlll llllllll 
g 0 9 0 DI ARNDGIQPALDAE I ADQ AG YRG F AVAAGNRNH L V AAAVHNVRQQLD V AX HAX RR FAX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 299>: 

a090 . seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTT 

351 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 300; ORF 090. a>: 
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a090 -pep 

1 MRWEQWVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 

51 LQFRLQNRRA DIARDNGIQP TLDAEIADQA RYRG FAVAAG NRNHLVAAAV 

101 HNVRQQFDVA QHAXRRFA* 

m09/aO90 91.5% identity over a 1 17 aa overlap 

10 20 30 40 50 60 

m090 . pep MRIVEQVVVAVEMV FGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 

| | : M I I I I I I I I I I I M I I I I I I : I I I I I I I I I : II I I I I I I I I I I I I I I I I I I! I I 
a090 MRVVEQVVVAVEMVFGNVQHCRRSRAQAFGVFQLETGKLQHPHVRLFAFALQFRLQNRRA 

10 20 30 40 50 60 

70 80 90 100 110 119 

m0 90 . pep DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLWPAVHNVRQQFDVAQHAXRRFAX 
I I I I I I I I It : II : I I I I M I I I I I I I I M I I I : I I : I I I I I I I I I I I II I I I I I I I I 
a 090 DIARDNGIQPTLDAEIADQARYRG FAVAAGNRNHLVAAAVHNVRQQFDVAQHAXRRFAX 

70 80 90 100 110 

The following partial DNA sequence was identified in N. gonorrhoeae 

g090-l.seq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-l.ng>: 

g090-l.pep (not shown) 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 2>: 

m090-l . seq 

1 ATGACGGCGT TTGCATTTCA GACGGCATCA CAAAGCCTTA AACGCTTCGA 

51 TAAACACTTC CGAACGGTGC GCGTAGCCTT TGAACATATC AAAGCTCGCG 

101 CAGGCGGGGC TGAGCAACAC AATATCGCCT GCTTCGGCTT GGGCATATGC 

151 CGTCTGAACG GCTTCTCCCA AAGTGGCGCA GTCGGTCATA TTCAAGCCGC 

201 AGCCGTCCAA ATCGCGGCGG ATTTGCGGCG CATCGACACC AATCAAGAAC 

251 ACGCCTTTTG CCTTGCCTAC CAGTGCATCG CGCAGGGGCG TGAAGTCCTG 

301 CCCTTTACCC ATGCCGCCCA AAATCACGAA GAGCGGATTT TGCAAACCGG 

351 CAATCGCGGC GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA 

4 01 AACACCACGC CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC 

4 51 TTGGAAGGTT TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG 

501 CCTCACACAA AGCCACGGCA GCCATGACGT TAGCGGCGTT GTGCAGACCT 

551 TGCAACGGAA TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG 

601 GCGGCCTGTC TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA 

651 ACCATTTTAC CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG 

701 TCCGCATTCA AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT 

751 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

801 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

851 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

901 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

951 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

1001 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

1051 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTTT 

1101 GCTTGAACGC CTCAATATCC GGCTGCCGCT CGCTGATGCC GGGACTGAGA 

1151 GCCAGAATAT CGAAACCGTT GTCCAGCGCA TCTTTCAGAC GGCCCGTGTA 

12 01 AAACACCAAC CCGTCAAACA TCTTACCGAT TTGCGACACG CGTTCCGGCT 

12 51 TCAGCTCCGC ATCATACGCA GCAACCTCCG CGCCGTTTTT GCGCAGGTAG 

1301 GCAATCATGG AAATACCCGT ACCGCCGAGT CCGGCGACGA GGATTTTTTT 

1351 GTTTTGAAAA GTCATTTTGG TTTGTCCTAA 

This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>: 

m090-l .pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHI KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGHIQAAAVQ IAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTHAAQNHE ERILQTGNRG GSRADIRAFA WDKHHAVFL ADFFHAVRQA 

151 LEGFDV FEQC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR DNQIFIALFQ 

2 01 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERI VRIQNLH AVATENLGFG 
251 MRIVEQVVVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 
301 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRG FAVAAG NRNYLVVPAV 

3 51 HNVRQQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRI FQTARV 
401 KHQPVKHLTD LRHAFRLQLR I IRSNLRAVF AQVGNHGNTR TAESGDEDFF 
451 VLKSHFGLS* 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091 . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT T CAGACGG CA TCGCGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT CCGTTTTGAC 

2 01 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCG 
251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 

3 01 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091 . pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQI ILG RRRPPLPKPL SDGIASRLIT 
51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAAS IVIHQ AQIVLGLGIP 
101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 305>: 

m091 . seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

2 51 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

301 CTGCGC. . . 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 09 1>: 

m091 . pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RLQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFG IHH AQIVLGLGYP 
101 LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 091.ng) 
from N. gonorrhoeae: 

m091/g091 



10 20 30 40 50 60 

m091 .pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

Illlllllllllllllh I IIIIMIhM IIIIMIIIIIII llllllllllll 

g091 ME I PVPPS PATRI FLFESHFGFVLKQI I LGRRRPPLPKPLSDGI ASRLI TRLQALVI VAA 

10 20 30 40 50 60 



70 80 90 100 

m0 91 .pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

Illllllllllhl : llllllh IhllllllM II 
g0 91 VLVSVLTSLAKPLLSERKVLAHAASIVIHQAQIVLGLGIPLFX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningiditis <SEQ ID 307>: 

a091 . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG ' 

51 GAAATCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

2 51 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 308; ORF 091.a>: 

a091 .pep 

1 MEIPVPPSPA TRI FLFWKSF WFVLKQIILS RGCLILLKPL SDGIASCSIT 
51 RFQ ALVIVAA VLVSVLTSLA KPFLCKGAVL ' AHAASFGIHH AQIVLG 

m o9i/a09i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

m091 .pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 
I I I I I I ! I I I M I II I I I II 11 I M I I I I I I I I I M I I I I t 11 M I : I I I M I I I 
a 0 91 MEIPVPPSPATRIFLFWKSFWFVLKQIILSRGCLILLKPLSDGIASCS ITRFQALVIVAA 

10 20 30 40 50 60 

70 80 90 100 

m091 . pep VLVSVLTSLAKPFLCKGAVLAHAASFGIHHAQIVLGLGYPLR 

I I I I I I i I I t I I ! I I II I I I I I I M I I I I M I i i I I 
a 0 9 1 VLVSVLT S LAK P FLCKGAVLAHAAS FG I HHAQIVLG 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 309>: 

g092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG ACCTTTAAAA TAACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACAT CCA TTTTGTCGGT 

151 ATCGGCGGCG T C GG CAT GAG CGGTATCGCC GAAGTCTTGC AC AAT TTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AG AACACGT T 

3 01 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGTCAAGA AAGAAaatcC 

3 51 CGAAGTtgtc gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 

4 01 TGATGCTGGC AGAG CTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 

5 01 GGCAGGACTC GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

5 51 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

6 01 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGTCGTTAC 
651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
8 01 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
8 51 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG GACATGAGCA GGGGT CGTTT GAAGT CGTGC TGAATATGCC 
951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggttgAA GCGAt cCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATT Ac g g ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT ATCCGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCG CGATT TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

13 01 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGactcCCG CGCCTTGGCG 

13 51 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 

14 01 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 
1451 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttg 
1501 gaattgtcga AACAGAtttg A 

This corresponds to the amino acid sequence <SEQ ED 310; ORF 092.ng>: 

g092 . pep 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLSSLGIQ VYPGHTAEHV 

101 NGADVWAST AVKKENPEW AALERQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDAS FLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 AFLCVDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

3 01 QMKGHEQGSF EWLNMPGRH NVLNALAAIG VALEVGASVE AI QKGLLGFE 
351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYPEKRLV 
401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP VAAADSRALA 

4 51 RAIRVLGKLE PIYCENVADL PQMLMNVLQD G DWLNMGAG SINRVPSALL 
501 ELSKOI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 1>: 

m092 . seq 

1 ATGTTTTTTA TTTCAATCCG C T AT AT ATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTTA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGCATCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCGGTTCGG ATCAgGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC CGAACACGTT 

3 01 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 

3 51 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

4 01 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 
4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 
551 GCACTAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
601 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 
651 CAATATCGAC G AAG AC CAT A TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT C T ACGG C AAA 
751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCG CGC AAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATACGGC GACAT C AAGT TGCCAAACGG 

1101 CGGGACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGACCCT TGCCGCCGCA CGCGGCGCGT ATCTGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTT AC CAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

1301 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CCGATTCCCG CGCTCTTGCC 

13 51 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

14 01 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 
1451 TGTTGAATAT GGGCGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
1501 GCATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 312; ORF 092>: 

m092 . pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

3 01 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYLE KRLV 

401 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP I AAADSRALA 

451 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GD I VLNMG AG SINRVPAALL 

501 ALSKQI * 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092. ng) 
from N. gonorrhoeae: 

m092/g092 

10 20 30 40 50 60 

m092 - pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 
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lllllllllllllll III! IMIIM I II Ml M MM M MIM Mill I MM 

g092 MFFISIRYIFVRKLWCANGQTFKITPLRTKNQPERNIMMKNRVSNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m092 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVWTSTAVKKENPEW 

I I IMMMIMI MM MMMMMll IMM M II Mill IMMMIIIM I II II 

g092 E VliHNLG F KVS G S DQ ARNAAT E H L S S LG I Q V Y P GHT AE HVNG AD VW AS T A V KKEN P E W 

70 80 90 100 110 120 



130 140 150 160 170 180 

m092 . pep AALEOQI PVI PRALMLAELMRFRDGI AI AGTHGKTTTTSLTAS I LGAAGLDPTFVI GGKL 

M MMM IIIIIMI MIMI I MMMM I II I II IM'M I MIMM MM M I M 

g092 AALERQI PVI PRALMLAELMRFRDGIAIAGTHGKTTTTSLTAS I LGAAGLDPTFVI GGKL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m092 . pep NAAGTNARLGKGEYIVAEADESDAS FLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 

MM I I I I I I I ! I 1 ! II M I I I! I I I 1 1 I I I ; 1 1 I I I I 1 1 

g092 N AAGTN ARLG KGE Y I V AE AD E S DAS FLH L T P I MSWTN IDE DHMD T YGH S VE KLHQ A FID 

190 200 210 220 230 240 



250 260 270 280 290 300 

m092 . pep F I HRMP F YGKAFLCI DS EHVRAI LPKVS KP YATYGLDDTAD I YATD I ENVGAQMKFTVHV 

I I MM MIMM IM I I II I II II MM Ml Ml IMIIM I I MMMIIMI M M I 

g092 F I HRMP FYGKAFLCVDS EHVRAI LPICVSKP YATYGLDDTAD I YATD I ENVGAQMKFTVHV 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 92 . pep QMKGHEQGSFEVVLNMPGRJDTVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

I I IMMMMM MM M I MMMI Mill! I M IMIIM II Ml I IMIIM !! M 

gO 92 QMKGHEQGSFEVVLNMPGEIHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 



370 380 390 400 410 420 

mO 9 2 . pep DI KIjPNGGTALLVDDYGHHPVEMAATIAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

I I IMMMMIIMM III III MMM MIM I I I 1 1 M I I 1 1 ! I I I 1 1 1 I I II I I I 

gO 9 2 DI KLPNGGTALLVDDYGHHPVEMAATLAAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 



430 440 450 460 470 480 

m092 . pep Vl^TVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVDQD 

i IMIM IMIIMM I IMMIMIMM Mill M IIIIIMI MIMMMM I Ml 

g092 VLNTVDALVLTEVYAAGEEPVAAADSRALARAIRVLGKLEPIYCE^ADLPQMLMNVLQD 

430 440 450 460 470 480 



490 500 
m092 .pep GD I VLNMGAGS I NRVP AALLALS KQ I X 

I Ml I I Mill M MM I I I MMM 

gO 9 2 GDWLNMGAGSINRVPSALLELSKQIX 

490 500 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 313>: 

a092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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301 AACGGTGCGG ATGTCGTCGT 

3 51 CGAAGTTGTC GCTGCGTTGG 

4 01 TGATGTTGGC GGAGTTGATG 
4 51 ACGCACGGCA AAACCACGAC 
501 GGCAGGACTT GACCCGACTT 
551 GCACCAACGC CCGCTTGGGC 
601 GAGTCGGATG CATCCTTTCT 
651 CAATATCGAC G AAG AC CAT A 
7 01 TGCATCAGGC GTTTATCGAT 
7 51 GCCTTTTTGT GTATTGACAG 
801 GAGCAAACCT TATGCTACTT 
851 CCACCGACAT CGAAAACGTC 
901 CAAATGAAAG GACATGAGCA 
951 CGGCAGACAC AACGTGCTGA 

1001 AAGTCGGCGC ATCGGTTGAA 

1051 GGTGTCGGCC GCCGCTTCCA 

1101 TGGAACCGCG CTCTTGGTGG 

1151 CGGCGACCCT TTCCGCCGCA 

12 01 CTCGCCTTCC AGCCGCACCG 

1251 CTTTACCAAA GTCCTCAATA 

1301 ATGCCGCCGG TGAAGAGCCG 

1351 CGCGCCATCC GCGTGTTGGG 

1401 TGCCGATCTG CCCGAAATGC 

14 51 TGTTGAATAT GGGTGCGGGA 

1501 GAATTGTCGA AACAGATTTG 



TACCTCTACC GCCGTCAAAA AAGAAAATCC 
AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 
CGCTTCCGTG ACGGCATCGC CATTGCCGGC 
CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
TCGTTATCGG CGGCAAACTC AACGCCGCAG 
AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
GCACCTGACA CCGATTATGT CCGTCGTTAC 
TGGATACCTA CGGGCACAGT GTTGAGAAGC 
TTCATCCACC GTATGCCCTT CTACGGCAAA 
CGAACACGTC CGCGCGATTT TGCCCAAAGT 
ACGGTTTGGA CGATACCGCC GACATCTACG 
GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
ACGCATTGGC AGCCATCGGC GTGGCGCTGG 
GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 
AAAATACGGC GACATCAAGT TGCCAAACGG 
ACGACTACGG ACACCACCCC GTCGAAATGG 
CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 
CTATACCCGC ACGCGCGATT TGTTTGAAGA 
CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 
ATTGCCGCCG CTGATTCCCG CGCTCTTGCC 
CAAACTCGAG CCGATTTACT GCGAAAACGT 
TGTTGAACGT TTTGCAGGAC GGCGACATCG 
AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
A 



This corresponds to the amino acid sequence <SEQ ID 314; ORF 092. a>: 

a092.pep 

MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

NGADWVTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPEKRLV 

LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 

RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 
ELSKQI* 



1 

51 
101 
151 
201 



401 
451 
501 



m092/a092 99.4% identity over a 506 aa overlap 

10 20 30 40 50 60 

m092. pep MFFISIRYI FVRKLWRANGQPFK IT PLR I EN PPERNIMMKNRVTN I HFVG IGGVGMSGIA 

I M 9 I M I ! I I I I I I I 1 I M I I M I I I I I II I I M ! I I II I I I I I I ! 1 I I I I I I t I I I I t 
a092 MFFISIRYI FVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIHFVGIGGVGMSGIA 

10 20 30 40 50 60 



70 80 90 100 110 120 

mO 92 . pep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADVWTSTAVKKENPEW 
I I I II I I I I M I I I I I I I I I 1 I I I I I II I II I M I I 1 I I I I I I M M I I I I I I I I I I II I 
a092 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEW 

70 80 90 100 110 120 



130 140 150 160 170 180 

m092 .pep AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 
I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I If I II I I I I I I I I I I 
a092 AALEQQIPVIPRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m092 .pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 
I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I ! I I I I I I I I I I I I I I I I I I I I I I I 
a 0 9 2 NAAGTN ARLGKGE Y I VAEADE S DAS FLH LTPIM S VVTN I DE DHMDT YGH S VEKLHQAFI D 
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190 200 210 220 230 240 

250 260 270 280 290 300 

m092.pep FIHRMPFYGKAFLCI DSEHVRAILPKVSKPYATYGLDDTADI YATDIENVGAQMKFTVHV 

I I I I I I I | M i I I i ! I M I I I I I M It 1 I i I I M I I! I I I M 1 I M I I I I 1 I I I I I M 11 
a0 92 FIHRMPFYGKAFLCI DSEHVRAILPKVSKPYATYGLDDTADI YATDIENVGAQMKFTVHV 

250 260 270 280 290 300 

310 320 330 340 350 360 

m0 92 . pep QMKGHEQGSFEWLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
I I I I I M I I M I I I I I I ! I t I ! I I 1 I I M I I I I I II M I ! I I I I I M I II I I I I I II I I I 
a0 92 QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

m092 . pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 
I I I I I I I I ! M i I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
a 0 92 DIKLPNGGTALLVDDYGHHPVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 460 470 480 

m0 92 . pep VLNTVDALVLTEVYAAGEEPIAAADSRAIARAIRVLGKLEPIYCENVADLPEMLLNVLQD 
I I I I I I I ! I I II I I I I I I I I I I I I I I I I M I I I I I I.I M I I I M I I I I I I I I M I It E I I 
a 0 92 VLNTVDALVLTEVYAAGEE PI AAADSRALARAI RVLGKLE P I YCENVADLPEMLLNVLQD 

430 440 450 460 470 480 

490 500 
m0 92 .pep GDI VLNMG AG SIN RV P AAL LAL S KQ I X 

I I I I I I II 1 I I I I I I I I I I I MINI 
a092 GDI VLNMG AG SINRVPAALLELSKQIX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 315>: 

g093 . seq 

1 aTGCAGAATt ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAaatC tcgcTGGACA GCgGTACCGC CATTTTGAAC GCCTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT T C AGACGGC A TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGGCAT GGACAAATAC 

3 01 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTTCGC 

3 51 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 

4 01 TGCCGATGTT TGTGAAGCCG GCGGCCGAAG GCAGCAGCgt cggcgtggta 
451 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 
501 CCTTcagggg cgaAAtcatt gccgAacgTT TTATCGG CGG CGGCGAATAT 
551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 
601 CGCAACCGAG TTTTACGAct acgaagccaa GtacaaCCGA GACGAcacca 
651 tttaTCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA ATCGGTGCGG AAGGCTGCGT 
751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
801 TCAACACCCT GCCCGGTATG ACCGGC C ATA G 

This corresponds to the amino acid sequence <SEQ ID 316; ORF 093. ng>: 

g093 .pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKS KG I DAY AFDPKETPLS 

51 ELKERGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLY DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKE KGRLKS VYEELKHLQG RNHCRTFYRR RRIFLPRPER QRAARHTHHP 

201 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RTGNR CGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 7>: 

m093 . seq 
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1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 G C AAAGG CAT AG ACG CAT AC GCCTTCGATC CTAAAGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCrAA GACGGGGCGG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

3 51 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 
4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 
501 CCTTCAGGG. CGAAATCATT GCCGAACGTT TTATCGGCGG CGGCGAATAT 
551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATTCC 
6 01 CGCAACCGAG TTTTACGACT ACGAAGCCAA GTACAACCGC G A CGAC AC C A 
6 51 TTTATCAATG TCCTTCGGAA GATTTGACCG AAG CCGAAG A AAGCCTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 
751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
8 01 TCAACACCCT GCCCGGTATG ACG AGC CAT A G 

This corresponds to the amino acid sequence <SEQ ID 318; ORF 093>: 

mO 9 3 . pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKS KG I DAY AFDPKETPLS 

51 ELKAQGFQTA FNI LHGTYGX DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRIFLPRPER QRAARHTHHS 

2 01 RNRVLRLRSQ VQPRRHHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093. ng) 
from N. gonorrhoeae: 

m093/g093 

10 20 30 40 50 60 

m093 . pep MQN FG KV AVLMGG FSSEREISLDS GT A I LNALKS KG I DA Y AFD P KE T P L S E L KAQG FQ T A 

I I I I I 1 1 I I I 1 1 I 1 1 1 1 1 1 1 1 1 1 I M I [ 1 ! 1 1 1 1 1 1 I ! 1 ! I 1 I I I I i 1 1 I I ! I :|MII 

g093 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKERGFQTA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m093 . pep FNILHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

Illllllll 1 1 1 E 1 1 1 1 1 1 1 1 1 1 E 1 I E I 1 1 1 1 1 1 1 I I I 1 1 1 I I I i 1 1 1 I I 1 1 I i i I I I ^ 

g093 FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLY 

70 80 90 100 110 120 

130 140 150 160 170 180 

m093 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

M I I I I M M I Mill Mil I IN I I M ! I 1 I I I I I I M I I II I I I llllllllll 

g093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKEKGRLKSVYEELKHLQGRNHCRTFYRR 

130 140 150 160 170 180 

190 200 210 220 230 240 

m093 . pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

II I M MM MM IMMI I I I I 1 1 1 I I 1 1 I I I 1 1 I I 1 1 I 1 1 I 1 1 1 1 1 I II I M I I I I I 

g093 RRIFLPRPERQRAARHTHHPRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 

190 200 210 220 230 240 

250 260 270 

m093 .pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 

M I I Ml I I I MIMIIIIM ilMI MIMI I Ml I 

g093 RTGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 

250 260 270 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 319>: 

a093 . seq 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

2 01 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

2 51 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 

4 51 AAAGT CAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG . CGAAATCATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CAAAGGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

7 51 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 320; ORF 093. a>: 

a093 .pep 

1 MQN FGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKH FQX RNHCRTVYRR RRIFLPCVER QRPARHTHHP 

201 RDRVL*LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDRP* 

m093/a093 95.7% identity over a 276 aa overlap 

10 20 30 40 50 60 

m093.pep MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 
I I 1 I I I I I I I I I I I I M t I I I I II I I I I II I I M M I I I I II I I I I I I I I I I I II I I I ! I 
a0 93 MQN FGKVAVLMGGFS SERE I SLDS GT AI LNALKSKG I DAYAFDPKET PLSELKAQG FQT A 

10 20 30 40 50 60 

70 80 90 100 110 120 

m093 . pep FNILHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 
I I I I I I I I I I II I I I I I I I I I I M I M I I I I I I I I I I II M I II I I I M I I I I I I I I II 
a0 93 FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m093 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 
I I I I I I M I M I I I I I I I I I I I M I II I I I I I I I I I II I 1 I I I I I I I : I I I I I I I I I I I 
a 093 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHFQXRNHCRTVYRR 

130 140 150 160 170 180 

190 200 210 220 230 240 

m093 . pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
I I I I I I I I I I MINI I : I M I I I I I M : I I I I I I II I I I I I I I I I I I I I I I I I 
a 093 RRIFLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSDRSRRKPDARTGGSR 

190 200 210 220 230 240 

250 260 270 

m093 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 
I I I I I I I II II I I I I I I I I I I II I I I I I I II I I I : I I 
a093 RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 

250 260 270 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 32 1>: 

g094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 cCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

2 01 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgattT TCTTTCTTGA 

2 51 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

3 01 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094.ng>: 

g094 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRISSAS IKARGITGIC RSNAATTSGF SFLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>: 

m094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTA GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

m094 -pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMAM 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094.ng) 
from N. gonorrhoeae: 

m094/g094 



m094 . pep 
g094 



10 20 30 40 50 60 

MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRINSAN 

1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 (! i 1 1 1 1 1 1 ! 1 3 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 = I E = 

MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRISSAS 
10 20 30 40 50 60 



70 80 90 100 

m0 94 . pep I RARG I TG I C CS NAATT S G F S F LT AVEVTTT S AP LT C S AVW PGX 

hllllllll I I il I I :l II : II I I hMM II I ;M I III II 

g0 94 I KARGITGI CRS NAATT SGFSFLTAVEATTTSAPLTCS AVW PGX 

70 80 90 100 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ED 325>: 

a094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 326; ORF 094. a>: 



WO 99/057280 



296 



PCT/US99/09346 



a094 .pep 

1 MYSPLPKRAL VPAALSLFPI TKVGSSPAAP RMEAVRLWV VLPCVPAMA M 

51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 

101 WPG* 

m094/a094 100.0% identity over a 103 aa overlap 

10 20 30 40 50 60 

m094 .pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 
I I II I M I! I II I t I I I I I f I I I I I I t I I I II I I I I I t I I I i I I 1 I I II M I i I I M I I I 
a0 94 MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLWWLPCVPAMAMPSRKRINSAN 

10 20 30 40 50 60 

70 80 90 100 

m094 .pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I- 1 I I I I I i I I I I I 
a094 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 327>: 

g095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 G C AT AAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AACACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATG CAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGGGTCA GTGTAGGAAA 

3 01 GAGG CAT CGG ATCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

3 51 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 328; ORF 095. ng>: 

g0 95 .pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDE I DK RL MQFFDAVPVG IHMVFVD IGN DGHNRGQCRK 

101 E AS DRRLRQR CIRLCPSGRW CLRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 329>: 

m095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 G CAT AAGGT T TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

3 01 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTAG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ED 330; ORF 095>: 

m09 5 .pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDE I DKR L MQFFDAVPVG IHMVFVD IGN DGHNRCQCRK 

101 DAS DRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095.ng) 
from TV. gonorrhoeae: 

m095/g095 

10 20 30 40 50 60 

mO 95 . pep MS FHLNMDG E FHLRAD V FDVGGVDVGG I VQTVS SIR F AH FGQNRAD V F AVNTQ KG F AVEG 

I t I I 1 1 1 I M I I r I I [ I 1 1 1 I I M M 1 1 1 1 1 1 1 1 1 M I 1 1 f I 1 1 i ! 1 1 I 1 1 E I I I 1 i I i i 
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g 0 9 5 MS FHLNMDGE FHLRADVFDVGGVDVGG I VQTVS S I RFAH FGQNRADV FAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m095 . pep HTVDEIDKRIJ^QFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII I I I i = I I I I I f I I I i t I I I 1 I t t 
g0 9 5 HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRGQCRKEASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 



m0 95.pep CLRRX 
I I II I 

g095 CLRRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 1>: 

a095.seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGCTTC TCAACACTGT GCCCGTAGGT AT C CAT AT G G 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GAT G CATC CG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 332; ORF 095. a>: 

a0 95.pep 

1 MS FHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADV FAV 

51 NTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRW CLRR* 

m095/a095 96.0% identity in 124 aa overlap 

10 20 30 40 50 60 

m095 . pep M S FHLNMDGE FHLRADVFDVGGVDVGG I VQTVS SIRFAHFGQNRADV FAVNTQKGFAVEG 

I I t I I I I I I I I I I ! I I I I I I I I I I I I I I I I I II I II I I I I I I I 1 M I I I I I I I I I I I I M 
a095 MS FHLNMDGE FHLRADVFDVGGVDVGG I VQTVS SIRFAHFGQNRADV FAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m095 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

I I I I I I I I I I I I :::: I I I t I I I I I I 1 I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I 
a0 95 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 

70 80 90 100 110 120 

m095.pep CLRRX 
I I I I I 

a095 CLRRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 

333>: 

g096 . seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTCGTAAAAC TCGGTTGCGG GGATGATGTG TATGCCGGGC 

2 51 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 AcgtTcggca atgaTTtcgc ccctgAAGGT GttTCAattc ttcgtaAACG 

351 CTTTTCAGAc ggccttTTTC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 334; ORF 096.ng>: 

g096 .pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLCAANR QFAHQAFFGF 

51 GOIFRRT LIN GVVSWLGFV VVKL GCGDDV YAGQPFAVQD GAG I FAAADK 

101 TFGNDFAPEG VSILRKRFSD GLFL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 335>: 



m096 . seq 

1 ATGGCTCGTC ATACCGGGCA 

51 CGTCGGTATC TTTGAGGAAA 

101 GCCTGCGCGC CGCGAACCGC 

151 GGTCAAATCT TCCGAAGGAC 

201 TGGCTTCGTA GTCGTAAAAC 

251 AGCCCTTTGC CGTTCAGGAC 

301 ACGTTCGGCA ATGATTTCGC 

351 CTTTTCAGAC GGCCTTTTCC 



GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 
TCGACGCGCA CGCAGCCTTC CGCACCGATT 
CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 
ATTGATAAAT GGTGTCGTCG CGGTTGTACT 
TCGGTTGCGG GAATGATGTG TATGCCGGGC 
GGGGCAGGAA TATTCGCCGC CGCCGATAAA 
CC.TGAAGGT GTTTCAATTC TTCGTAAACG 
TTTGA 



This corresponds to the amino acid sequence <SEQ ID 336; ORF 096>: 

m0 9 6 . pep 

1 MARHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 

51 GQIFRRT LIN GVVAVVLGFV VVKL GCGNDV YAGQPFAVQD GAG I FAAADK 

101 " TFGNDFAXEG VSILRKRFSD GLFL* 



m096/g096 96.0% identity in 124 aa overlap 



10 20 30 40 50 60 

m096 . pep MARHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQIFRRTLIN 
II I I I 1 I I I I I I M I I I I II 1 I I I I I I I I II I It I I M I I I I I I I I I I I I I I I I I M I 
g 0 96 MAGHT GQGVD FQQIEFAVGI FEE I DAHAAFRTDCLCAANRQFAHQAFFGFGQIFRRT LIN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m0 96.pep G W A WLG FWVKLGCGN DV Y AGQP FAVQDGAG I FAAADKT FGN D FAXEG V S I LRKRFS D 

M i : I M I M II II II I : I I I I I I I I I II I I I I I I I I I I II I I i I I I I I I M I I I I I M 
g 096 GWSWLGFVWKLGCGDDVYAGQPFAVQDGAGI FAAADKT FGNDFAPEGVSILRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
I I I I I 



g096 GLFLX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 337>: 
a096 . seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAGATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTACT 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

2 51 AGGCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 CCGTTCGGCA ATGATTTCGC CCT . GAAAGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ED 338; ORF 096.ng>: 
a096.pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 
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51 GQI FRRT LIN GWAWLGFV VIKL GRGDDV YAGQAFAVQH RAG I FAAADK 
101 PFGNDFAXES VSILRKRFSD GLFL* 

m096/a096 92.7% identity in 124 aa overlap 



10 20 30 40 50 60 

mO 9 6 . pep MARHTGQGVDFQQIEFAVGI FEEI DAHAAFRTDCLRAANRQFAHQAFFGFGQI FRRTLIN 

II I M I It I I I I I M I II I I I I 1 1 I I I I ! I 1 M 1 I I II ! I M I ! I M I II I I 1 I I I I I I 
a09 6 MAGHTGQGVDFQQIEFAVG I FEE I DAHAAFRTDCLRAANRQFAHQAFFGFGQI FRRTLIN 

10 20 30 40 50 60 



70 80 90 100 110 120 

m096 .pep GVVAVVLGFVVVKLGCGNDVYAGQPFAVQDGAGIFAAADKTFGNDFAXEGVSILRKRFSD 
I I I I M I I II I : i M 1:111111 Mil I I I I I I I II ! M I I I I I : I 1 I I I I I I I I 
a0 9 6 GVVAWLGFVV I KLGRGDDVYAGQAFAVQHRAG I FAAADK PFGNDFAXES VSILRKRFSD 

70 80 90 100 110 120 



m09 6.pep GLFLX 
I I I I I 



a0 9 6 GLFLX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGGTTTTTA AC CTGAAGG C 

51 AAACGGTACG ACGGTACGTA CCGAGTTGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACAT CGTT ATCGTCAATC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGC GGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

2 01 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

2 51 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 
301 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 

3 51 TTTCATCCTG TTC AG CTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 

4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA AT CCGGC AAC 

5 01 CTTGGTCGGC TTGGGCGATA TTCATCAGCC C AG CGCACTG TTGGCATTGT 

5 51 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTCCGCGT TCAAGGCGCA 

6 01 ATCAT CAT C A CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 
701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGTCAGCAT GGTCAGCGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 
801 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG C CAT TGTGG C AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GG CAGGCGG A CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTG TCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGCGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

1201 TT CAT CAG CT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 

12 51 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097.ng>: 

g097 .pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IVN PLILGET 

51 GMDMG AV FVA TCIASAIGCF VM GFIGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IV ANPATLVG LGDIHQPS AL LALFGFVMVV VLGYF RVQGA 

201 IIITILTITV IASLMGLN EF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

2 51 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 
301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMIACLMF S PLAKSVPVFA 

3 51 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 
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4 01 FISYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG * 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 34 1>: 

m097.seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC 

51 AAACGGTACk ACGGTGCGTA CCGAGTTGAT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC 

151 GGCATGGATA TGGGGG CGGT ATTCGTCGCT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG 

3 51 TTTTATCCTG TTCAGCTTTT TTAAAGTCAG 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG 
4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT 
501 CTTGGTCGGT TTGGGCGATA TTCATCAGCC 
551 TCGGTTTTGC TATGGTGGTC GTATTGGGAC 
601 AT CAT CAT CA CCATCTTGAC CATTACCGTC 
651 GAATGAATTT CACGGCATCA TCGGCGAAGT 
701 TTATGCAGAT GGATTT TGAA GGCCTGTTTA 
751 ATTTTCGTCT TCTTCTTGGT CGATCTATTT 
801 CGGCATATCC CACCGTGCCG GGCTGCTGGT 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG 
901 TTGGGTACTT CTTCCACCAC GCCTTATGTG 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG 

1101 GGATATTGAT TGGGACGATA TGACGGAAGC 

1151 TTGTTTTCAT GCCGTTTACT TATTCGATTG 

1201 T T CAT CAGTT ATGCCGTGGT TAAACTTTTA 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT 

1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 342; ORF 097>: 

m097 .pep 

1 MDTSKQTLLD GIFKLKANGT TVRTELMAGL TTFLTMCY1V IVN PXILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IVA NPATLVG LGDIHQPS AL LALFGFAMW VLGHF RVQGA 

201 IIITILTITV I AS LMGL NE F HGIIGEVPSI APTFMQMDFE GLFTVSMVSV 

251 I FVFFLVD LF DSTGTLVGIS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

3 51 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

401 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from TV. gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097.ng) 
from N. gonorrhoeae: 

m097/g097 

m097 . pep 
g097 



m097 . pep 
9097 

70 80 90 100 110 120 

130 140 150 160 170 180 



GGGATTTTTA 
GGCGGGTTTG 
CTCyGATTTT 
ACCTGTATCG 
CTATCCGATT 
TTGCCGTCGT 
GTGTTCATCT 
GGAAATGCTG 
CCGGTATCGG 
ATCGTTGCCA 
GTCCGCGTTG 
ATTTCCGCGT 
ATTGCCAGCC 
ACCGAGCATT 
CCGTCAGCAT 
GACAGTACCG 
GGACGGTAAG 
CCATTGT GGC 
GAAAGCGCGG 
TACCGTCGGC 
AAAGTGTTCC 
CAGATGCTCC 
CGCACCTGCG 
CAGACGGCAT 
TGCCGCCGCA 
GTGGGCACTG 



AGCTGAAGGC 
ACAACTTTTT 
GGGCGAGACC 
CGTCTGCCAT 
GCACTCGCAC 
TAAGGGTATG 
CCGGTCTGAT 
GTCAACGCAC 
TTTGTTTTTG 
ATCCGGCAAC 
TTGGCATTGT 
TCAAGGCGCA 
TGATGGGTTT 
GCGCCGACTT 
GGTCAGTGTG 
GAACGCTGGT 
CTGCCCCGCC 
AGGTGCGGCT 
CGGGCGTATC 
GTATTGATGC 
CGCTTTTGCC 
GCAGTGCGAG 
TTCCTGACCA 
CGCTTTCGGC 
C CAAAGACGT 
AAATTCTGGT 



10 20 30 40 50 60 

^TSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGMDMGAVFVA 

II I j 1 1 i 1 1 MM 1 1 I MM II III I II Ml 1 1 II MM 1 1 1 M M 11 i 1 1 1 1 I 1 1 1 

MDIS KQTLLDRVFNLKANGTTVRTELMAGLTT FLTMC Y I VI VNPL I LGETGMDMGAVFVA 
10 20 30 40 50 60 

70 80 90 100 110 120 

TC I ASAI GCFVMGFVGNY P I ALAPGMGLNAY FTFAWKGMGVPWQVALGAVFI SGLI FI L 

i i 1 1 i 1 1 1 1 1 1 1 1 ] = I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 

TCIASAIGCFVMGFIGNYPIALAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 
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m097 -pep FS FFKVREMLVNALPMGLKMS I AAGIGLFLALI SLKGAGI I VANPATLVGLGDIHQPSAL 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIMIIIIMIMIIIIIIMIIM 

gO 97 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGI I VANPATLVGLGDIHQPSAL 

130 140 150 160 170 180 



190 2 00 210 220 230 24 0 

m0 97 .pep LALFGFAMVWLGHFRVQGAI I ITILTITVIASLMGLNEFHGI IGEVPSI APTFMQMDFE 

1 I j i I M I I I I 1 I U I M I - I I I I I j I I I i 1 I I i 1 1 : : ! ! I I : i ! I I I I I I I 1 ^ 

gO 9 7 LALFGFVMVWLGYFRVQGAI 1 1 TILTITVI ASLMGLNEFHGWGEVPGI APTFMQMDFK 

190 200 210 220 230 240 



250 260 270 280 290 300 

m097 . pep GLFTVSMVSVI FVFFLVDLFDSTGTLVGI SHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

1 1 ! I I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 fi 1 1 = M I I ! I I E i 1 1 1 1 E 1 1 1 1 1 1 1 1 II I ! I I I 1 1 

g097 GLFTVSMVSVI FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 9 7 . pep LGTSSTTP WESAAGVSAGGRTGLTAVTVGVLMLACLMFS P LAICS VPAFATAP ALL YVGT 

IIIIIIIIIIIIIIIIIIIIIMIIIIMIMMIIIIIMIIIIIhllllllllllll 

g0 97 LGT S S TT P YVE S AAG VS AGGRTG LT AVTVG VLMLAC LM F S P LAKS V P V F AT AP AL L YVG T 

310 320 330 340 350 360 



370 380 390 400 410 420 

m097 . pep QMLRS ARD I DWDDMTE AAPAFLT I VFMP FTYS I ADGI AFGFI S YAWKLLCRRTKDVP PM 

MIMIMIIIIillllllllMIMIIIIIIIMIIIIIIMIIMMMIII Mill 

g097 QMLRS ARD I DWDDMTE AAPAFLT I VFMP FTYS I ADGI AFGFI S YAW KLLCRRTGDV PPM 

370 380 390 400 410 420 



430 

m097 . pep VWIVAVLWALKFWYLGX 
I I : I I I! I I E I I I I I M 
g0 97 VWWAVLWALKFWYLGX 

430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 343> 
a097 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

351 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 

551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 

601 AT CAT CATC A CCATTTTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 

651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 

701 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

7 51 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 

801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 

901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 

951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATCGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

1251 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097. a>: 

a0?7.pep 

1 MDTSKQTLLD GIFKLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAGIGLFL 

151 ALISLKGAGI IV ANPATLVG LGPIHQPS AL LALFGFAMVV VLGHF RVQGA 

201 IIITILTITV IASLMGL NEF HGIIGEVPSI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV DLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 

351 TAPALLYVGT QMLRSARDID WDDMTEAAPA FLTIVFMPFT YSIADGIAFG 

4 01 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG* 



m097/a097 99.3% identity in 436 aa overlap 

10 20 30 40 50 60 

m097 .pep M DTSKQTLLDG I FKLKANGTTVRTELMAGLTTFLTMCY I VI VNPXILGET GMDMGAVFVA 

I II I I I I I I I I I i I I I I M I M I I I I I 1 I I I I I I I I 1 I I I I I I I I I M I I I I I I I f I I I 
a097 MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFTFAWKGMGVPWQVALGAVFISGLIFIL 
I I I I I M I t I I 1 I M I I I I I I I I I I M I I I I I I 1 I I I I I I I I I M I I M I! I I I I I M I I 
a097 TCIASAIGCFVMGFVGNYPIALAPGMGLNAY FTFAWKGMGVPWQVALGAVFISGLIFIL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m097 . pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVANPATLVGLGDIHQPSAL 
I I I M I I I II I I I M I I I I I I 1 I I I I I I I I I I I I I I I I 1 I I I I ! I I I M I I I I I I II 1 I I 
a 0 97 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGIIVAN PATLVGLGDIHQPSAL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m097 .pep LALFGFAMWVLGHFRVQGAIIITILTITVIASLMGLNEFHGIIGEVPSIAPTFMQMDFE 
I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I J I I I I M I I I I I I I I I I I I I I I I I I I M : 
a097 LALFGFAMVWLGHFRVQGAI I ITILTITVIASLMGLNEFHGIIGEVP SI APTFMQMDFK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m097 . pep GLFTVSMVSVI FVFFLV DLFDSTGTLVG I SHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

I I I II I It I I I I I I I I I I I I I I I I I ! I I : I I t I I I I I I I I I I I I I I I I I I I I I I I I I I M 
a097 GLFTVSMVSVI FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAIVAGAA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m097 . pep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
I I I I I I I I! I I I I I I I I I I I II I I I I M I I I I I I I I I M I M I I I I I I I I I ! I I I I I I I I 
a097 LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 
I I i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II 
a0 97 QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

370 380 390 400 410 420 

430 

m097 . pep VWIVAVLWALKFWYLGX 
1 I I I M I I I I I I I I I I I 
a0 97 VWIVAVLWALKFWYLGX 

430 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 345>: 

g098.seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTGCGAATA TTGAT AC C AG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCG C A TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 
301 GTTGGGCAGA TGGCCGTGAA TCAGCAAGTG TGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 346; ORF 098. ng>: 

g098.pep 

1 MTADGLFVAF NFNTFAWRI LIPVQQDAAQ AGDQFVGDVA RFAVGMAFAF 

51 GMNAAEHGHA GTHHVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC QIRMS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 347>: 

m098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCGAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

2 01 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

3 01 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 
3 51 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 348; ORF 098>: 

m098 -pep 

1 MTADGLFVAF NLNAFAWRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 
from N. gonorrhoeae: 

m098/g098 

10 20 30 40 50 60 

MTADGLFVAFNLNAFAWRI LI PVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 

IIIIIMIIIhhllMIMMIhllhllllllllllll: 111 = 1 1111-11 = 1 

MTADGL FV AFN FNT F AWR I L I PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

MhlllllMIIIMIIIIIMIIIIIIIIIIIIIIIIMMMMIII MINIMI 

GTHHVHRMGMCRQAFQNFNHTDRQAAHG FE LG F I S GQLE FVGQMAVNQQVCDF F KLAFLC 
70 80 90 100 110 120 



m098 .pep 
g098 

m098 . pep 
g098 



m098.pep QIRMSX 

MIMI 

g098 QIRMSX 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 349> : 

a098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

2 01 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098. a>: 
a098.pep 

1 MTADGLFVAF NLNAFA WRI LIPVQEDAAE AG DQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 



m098/a098 



100.0% identity in 125 aa overlap 



10 20 30 40 50 60 

m098 . pep MTADGLFVAFNLNAFAWRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 
I M I I I 1 I M I I I I I I M I I I I I I I I I I M I I I II M I I II I I II I I I I I 1 I I I I I I I I I 
a098 MTADGLFVAFNLNAFAWRI LIPVQEDAAE AG DQFVGDVARFT FRMAFTFRMNAAQHGYA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m098 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 
I I I I F I I I 1 I I I M I I II I I I I I I I t I I I I I I I I I I I I I M I I I M I 1 I I I I I I I I I I M 
a098 GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

70 80 90 100 110 120 



m098 .pep 



a098 



QIRMSX 
I I I I I I 
QIRMSX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 35 1>: 

g099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTGGA 

51 GCTGACGGGC AAACGGCAGG CGGGCATTAC TGCCACAGAC ATCGTGTTGG 

101 CACTGACCGA ATTCTTGCGT AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 

151 TTTTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCCATG TTCGCCATCG 

251 ACGCGCAAAC TATTGATTAT TTGAAACTGA CCGGACGTGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAAC CTA CGCCAAAACC G CAGGCTT AT GGGCAGGTGG 

3 51 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCCACC 
4 51 GCCGATTTGG CGG CGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCAGA 
501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCGTGTA 
551 CCAATACTTC CAACCCGCGC AACGTTGTCG CCGCCGCACT GTTGGCACGC 
601 AATGCCAACC GCCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
651 TGCCCCGGGT T C AAAAGT AG CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCATGTACC 
751 ACCTGTAACG GCATGAgcgG CGCGCTcgaC CCGAAAATCC AACAAGAAAT 
801 CATCGACCGC GAtttgtacg cCACCGCCGT ATTGTCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCTTTGGTCG TTGCCTACGC ATTGGCAGGT AGCATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCCG CGAAATCCGC CTGAAAGATA 

1001 TCTGGCCGAC AGACGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAACAAT TCCGCGACAT TTATATCCCG ATGTCCGACA CCGGCACAGC 

1101 G C AAAAAGC A CCAAGCCCGC TGTACGACTG GCGACCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCCTATTGG GAAGGCGCAC TGGCAGGGGA ACGTACATTA 

12 01 AGAGGTATGC GTCCGCCGGC GATTTTGCCC GACAACATCA CCACCGACCA 

12 51 CATCTCgcca tCCAATGCGA TTTTGGCCGG cagTGCcgca ggtgaATATT 
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13 01 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGAC CGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTtcgt 
14 51 tggcacgcgT tgaacCAGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTT AT CAT CAT TG CCGGTGCGGA 
1551 CTATGGTCAA GGCTCAAGCC GCGACTGGGC GGCGAAGGGC GTGCGGCTGG 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT TCGAGCGCAT CCACCGCACC 
16 51 AACCTCATCG GCATGGGCGT CTTGCCGCTG CAATTCAAAC CCGGCACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 
1751 AACGCACACC GCGCTGCGGC CTGACCCTCG TGATTCACCG TAAAAACGGA 
1801 GAAACCGTCG AAGTTCCGGT TACCTGCCGC CCCGATACCG CAGAAGAAGC 
1851 ATTGGTATAT GAAGCCGGCG GCGTATTGCA ACGGTTTGCA CAGGACTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099.ng>: 

g099 -pep 

1 MLiGRASMMRL PDIVGVELTG KRQAG I TATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDAQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWAGGLKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAGIYLKE AGLLPEMEKL G FG I VAF ACT 

251 TCNGMSGALD PKIQQEIIDR DLYATAVLSG NRNFDGRIHP YAKQA FIASP 

3 01 P L W A YALAG SI RFDIENDV LGVADGREIR LKDIWPTDEE IDAIVAEYVK 

3 51 PQQFRDIYIP MSDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 
401 RGMRPPAILP DNITTDHISP SNAILAGSAA GEYLAKMGLP EEDFNSYATH 
451 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 
5 01 ETYMNRKQPL I I I AGADYGQ GSSRDWAAKG VRLAGVEAIA AEGFERIHRT 
551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCG LTLVIHRKNG 
601 ETVEVPVTCR PDTAEEALVY EAGGVLQRFA QDFLEGNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 353>: 

m099 . seq 

1 ATG CTGGG AC GCG CGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGCAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCG CGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

2 01 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTAT C CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

401 TAACGCGCAA TATGGCAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 

4 51 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 
501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT ACCAGTTGCA 
551 CCAACACTTC CAACCCGCGC AACGTTGT TG CCGCCGCGCT CTTGGCACGC 
601 AATGCCAACC GTCTCGGCTT GAAACG CAAA CCTTGGGTGA AATCTTCGTT 
651 TGCCCCGGGT T CAAAAGT AG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCCTGCACC 
751 ACCTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 
801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTATCAGGC AACCGCAACT 
851 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT T CG AT AT TG A 
951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTT CG AC A CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

1201 AGAGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 

12 51 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GGCGAGTATT 

13 01 TGGCGAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 
14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC AT CATCATTG CCGGTGCGGA 
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15 51 CTATGGTCAA GGCTCAAGCC GCGACTGGGC TGC AAAAGG C GTACGCCTCG 

16 01 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 
16 51 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AAC C T AC G AC GTGGTCGGCG 
1751 AACGCACACC GCGCTGCGAC CTGACCCTCG TGATTCACCG TAAAAACGGC 
18 01 GAAACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG CAGAAGAAGT 
1851 ATTGGTATAT GAAGCCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354; ORF 099>: 

m099 . pep 

1 MLGRASMMRL PDIVGVELNG KRQAGITATD I VLALTE FLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGSVIIAAI TSCTNTSNPR NWAAALLAR 

201 NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQAFLASP 

3 01 PLWAYALAG SI RFDIENDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 

351 PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

401 RGMRPLAILP DNITTDHLSP SNA I LAVS AA GEYLAKMGLP EEDFNSYATH 

451 RGDHLTAQRA TFANPKLFNE MVKNEDGSVR OGSFARVEPE GETMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VR LAGVE A I V AEGFERIHRT 

551 NLIGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 099 shows 96.2% identity over a 639 aa overlap with a predicted ORF (ORF 099.ng) 
from N. gonorrhoeae: 

m099/g099 

10 20 30 40 50 60 

m099 . pep MLGRASMMRLPDIVGVELNGKRQAGITATD I VLALTE FLRKERWGAFVE FFGEGARSLS 

II I I I I I MM M III M MMIM II II I I Ml I I II IMM I I II I I I MM II M II 

g099 MLGRASMMRLPDIVGVELTGKRQAGITATD I VLALTEFLRKERWGAFVE FFGEGARSLS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m099 - pep I GDRAT I SNMTPEFGATAAMFAIDEQT I DYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

Ml I II I II II I I III II I I IMI III I Ml II II M Mill I Ml I I I MM I Mil 

9099 I GDRAT I SNMTPE FGATAAMF AI D AQT I DYLKLTGRDDAQVKLVE TYAKTAGLW AGGLKT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 9 9 . pep AV Y PRVL KFD LS S VTRNMAG P SN PHAR F AT AD LAAKGLAKP YE EPS DGQM PD G S V 1 1 AAI 

! 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 II 11 1 11 I M i 1 M I 1 1 [ I 1 1 1 1 1 = 1 1 1 1 1 1 

g099 A VYPRVLKFDLSSVTRNMAG PSNPHARFAT AD LAAKGLAKP YEEPSDGQMPDGAVI I AAI 

130 140 150 160 170 180 

190 200 210 220 230 240 

m099 - pep TS CTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAE I YLKE AGLLPEMEKL 

MM II M III II III I I I I Mill III I Ml II I I II MUM lllllllllllllll 

g0 99 TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAG I YLKE AGLLPEMEKL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m099 . pep GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 

INI M Ml III I II I I M MIMMMI I MM I II M I I I I MM I I I MM II III I 

9099 G FG I V AF AC TTCNGM S GALD P K I QQ E 1 1 D RD L Y AT AVLS GNRN FDGR I H P Y AKQ AF LAS P 

250 260 270 280 290 300 

310 320 330 340 350 360 
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m099 . pep PLWAYALAGSIRFDIENDVLGVADGKEIRLKDIWPADEEIDAVVAEYVKPQQFRDVyVP 

IIIMIIllllllllllll!lllllhlllll!ll|:||||lhl!lllllllllhhl 

g09 9 PLWAYALAGSI.RFDIENDVLGVADGREIRLKDIWPTDEEIDAIVAEYVKPQQFRDIYIP 

310 320 330 340 350 360 



370 380 390 400 410 420 

m099 - pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 

I IIIIMIIIIIIMIIIIIIiMlllllllllllllllillll lllllillllhll 

g09 9 MSDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPPAILPDNITTDHISP 

370 380 390 400 410 420 



430 440 450 460 470 480 

mO 9 9 . pep SNAI LAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

1 1 1 I I I IIIIIIIIIIIIIIMIIIIIIIMIIIMIIIIIIIIIIMIIhlllllM 

g099 SNAILAGSAAGEYLAKMGLPEEDFNSYATHRGDHLiTAQRATFANPKIjFNEMVRNEDGSVR 

430 440 450 460 470 480 



490 500 510 520 530 540 

m099 . pep qgsfarvepegetmrmweaietymnrkqpliiiagadygqgssrdwaakgvrlagveaiv 

I! : I I i III h! !| II ; II !i I I I ! II I ! I I IN I I I IN II M II I I I Mil i I M I : 
g099 qgslarvepegqtmrmweaietywnrkqpliiiagadygqgssrdwaakgvrlagveaia 

490 500 510 520 530 540 



550 560 570 580 590 600 

m099 . pep aegferihrtnligmgvlplqfkpdtnrhtlqldgtetydwgertprcdltlvihrkng 

I II; M MM M MM M I Mill I I II I I I I I I I I I I I M I I I M ! I MIMIIMI 
g099 aegferihrtnligmgvlplqfkpgtnrhtlqldgtetydwgertprcgltlvihrkng 

550 560 570 580 590 , 600 



610 620 630 640 

mO 9 9 . pep etvevpvtccldtaeevlvyeaggvlqrfaqdflegnaax 
I M I I I II I MMM.MMMM'i iMIMMMM 

g 0 9 9 ETVEVPVTCRPDTAEEAL.VYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 355>: 

a099. seq 

1 atgctgggac gcgcgtccat gatgcgcctg cccgatattg tcggcgttga 

51 gctgaacggc aaacggaagg cgggcattac ggcgacggat attgtgttgg 

101 cactgaccga gtttctgcgc aaagaacgcg tggtcggggc gtttgtcgaa 

151 ttcttcggcg agggcgcgag aagcctgtct atcggcgacc gcgcgaccat 

201 ttccaacatg acgccggagt tcggcgcgac tgccgcgatg ttcgctattg 

2 51 atgagcaaac cattgattat ttgaaactga ccggacgcga cgacgcgcag 

301 gtgaaattgg tggaaaccta cgccaaaacc gcaggcttgt gggcagatgc 

351 cttgaaaacc gccgtttatc cgcgcgtttt gaaatttgat ttgagcagcg 

4 01 taacgcgcaa tatggcaggc ccgagcaacc cgcacgcgcg ttttgcgacc 

4 51 gccgatttgg ccggcaaagg cttggctaaa ccttacgaag agccttcaga 

501 cggccaaatg cctgacggtg cagtgattat tgccgcgatt acttcctgta 

551 ccaatacttc caatccgcgc aacgttgtcg ccgccgcgct gttggcacgc 

601 aatgccaacc gcctcggctt gcaacgcaaa ccttgggtga aatcttcgtt 

651 tgccccgggt tcaaaagtag ccgaaatcta tttgaaagaa gcagatctgc 

7 01 tgcccgaaat ggaaaaactc ggcttcggta tcgttgcctt cgcatgtacc 

7 51 acctgtaacg gcatgagcgg cgcgctggat ccgaaaatcc agaaagaaat 

801 catcgaccgc gatttgtacg ccaccgccgt attgtcaggc aaccgcaact 

851 ttgacggccg tatccatccg tatgcgaaac aggctttcct cgcttcgcct 

901 ccgttggtcg ttgcctacgc gctggcaggc agcatccgtt tcgatattga 

951 aaacgacgta ctcggcgttg cagacggcaa agaaatccgc ctgaaagaca 

1001 tttggcctac cgatgaagaa atcgatgcca tcgttgccga atatgtgaaa 

1051 ccgcagcaat ttcgcgacgt ttatatcccg atgttcgaca ccggcacagc 

1101 gcaaaaagca ccaagcccgc tgtacgactg gcgtccaatg tctacctata 

1151 tccgccgccc accttactgg gaaggcgcac tggcagggga acgcacatta 

1201 agcggtatgc gtccgctggc gattttgccc g ac aac at c a ccaccgacca 
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1251 TCTCTCGCCA TCCAATGCGA TTTTGGCAAG CAGTGCCGCA GGCGAATATT 

1301 TGGCAAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

13 51 CGTGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTTCGC 
14 51 TGGCACGCGT TGAACCCGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTC AT CAT CAT TG CCGGCGCGGA 
1551 CTACGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 
1601 CCGGCGTGGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 
1651 AACTTGATCG GTATGGGCGT GTTGCCGCTG CAGTTCAAAC CGGGTACCAA 
17 01 CCGCCACACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 
17 51 AACGCACACC GCGCTGCGAC CTGACCCTTG TGATTCACCG TAAAAACGGC 
1801 GAGACCGTCG AAGTCCCCAT TACCTGCCGC CTCGATACCG CAGAAGAAGT 
1851 GTTGGTATAT GAAGCCGGTG GCGTATTGCA ACGGTTTGCA CAGGATTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ED 356; ORF 099. a>: 
a099 .pep 

1 MLGRASMMRL PDIVGVELNG KRKAG I TAT D IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAGKG LAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

2 01 NANRLGLQRK PWVKSSFAPG SKVAEIYLKE ADLLPEMEKL GFG I VAFACT 

251 TCNGMSGALD PKIQKEIIDR DLYATAVLSG NRNFDGRIHP YAKQA FLASP 

301 PLWAYALAG SI RFDIENDV LGVADGKEIR LKDIWPTDEE IDAIVAEYVK 

351 PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGLP EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVEPE GQTMRMWEAI 

501 ETYMNRKQPL IIIAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA+ 



m099/a099 97.5% identity in 639 aa overlap 

10 20 30 40 50 60 

m09 9 . pep MLGRASMMRLPDIVGVELNGKRQAGITATDIVLALTEFLRKERVVGAFVE FFGEGARSLS 

I I II I 1 M I I I I I I II I i I I I I : I 1 I M I 1 I I I I I I I I I I M I I I I 1 I I I 1 M I M I M I 
a09 9 MLGRASMMRLPDIVGVELNGKRKAGITATD I VLALTEFLRKERVVGAFVE FFGEGARSLS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m099 . pep IGDRATISNMTPEFGATAAMFAIDEQTIDYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

I I M I I I I M I I M I I I I I I I 1 I I I I I I I I I I i I I I I I I I I I I I I 1 I I I I ! I M I I I I I I 
a09 9 I G DRAT I SNMTPEFGATAAMFAIDEQT I DYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m09 9 .pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 
I I II I I I I t I I I I I I I I M I I I I II I I I I II I M : I I I I I II I 1 I I I I I I I t I : I II I I I 
a099 AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAGKGLAKPYEEPSDGQMPDGAVIIAAI 

130 140 150 160 170 180 



190 200 210 220 230 240 

m099 .pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEIYLKEAGLLPEMEKL 
I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I ! I II I I I I I I II 1 
a099 TSCTNTSNPRKWAAALLARNANRLGLQRKPWVKSSFAPGSKVAEIYLKEADLLPEMEKL 

190 200 210 220 230 240 



250 260 270 280 290 300 

m099 . pep GFGIVAFACTTCNGMSGALDPKIQKEI I DRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
a09 9 GFGIVAFACTTCNGMSGALDPKIQKEI I DRDLYATAVLSGNRN FDGR I HPYAKQAFLASP 

250 260 270 280 290 300 



m099 . pep 



310 320 330 340 350 360 

PLWAYALAGSIRFDIENDVLGVADGKEIRLKDIWPADEEIDAWAEYVKPQQFRDVYVP 
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I I I I I I I I I l I I M I I I I I I I I 1 I I II t I I M I I I I : I I I I I I : I t I I I 11 I II I II I : I 
a0 99 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 

310 320 330 340 350 360 

370 380 390 400 410 420 

m099 .pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITTDHLSP 
I I I i I I M I I I I Ml I II I I I I I I I I M I I I I I I I I I I i I I II I I I I I I I M I I II I II 
a0 9 9 MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAILPDNITTDHLSP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m0 9 9.pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I I I I I I I I II I I I M I I I I M I I I I I I I M I I II M I I I I I I I I I I I I I I :! I I I I I I 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 440 450 4 60 470 480 

490 500 510 520 530 540 

m099.pep QGSFARVEPEGETMRMWEAIETYMNRKQPLI I IAGADYGQGSSRDWAAKGVRLAGVEAIV 

I I I : I I I I I I I : I I I M I I I II I I I I I ! II II I I I I M I I I I I I I I I I I I I I I M I I I I I 
a0 99 QGSLARVEPEGQTMRMWEAIETYMNRKQPLII IAGADYGQGSSRDWAAKGVRLAGVEAIV 

490 500 510 520 530 540 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 

II I M I I II I II I I I I I I I I I I I I M I I I I I II I M I I I I I I I I I M I 1 I M I I I I I I I 
a09 9 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDWGERTPRCDLTLVIHRKNG 

550 560 570 580 590 600 

610 620 630 640 

mO 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 
111111:11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 
a0 99 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 357>: 

gl02 . seq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccgcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacacctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 tacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtcttc tTCGGCATCC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 
501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

5 51 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 
601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 
651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 
701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 
751 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 
801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 
851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 
901 ggcctgtttg acaacatcgc cgacatcttc aaatggaacg acagtatgtc 
951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaagGcatc atccccgcca tgctgctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttcg gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 

This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02 -pep 
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1 MSAKTPSLFG GAMIIAGKVI GAGMFPNPTA NLGDGLIGSL IVLLYTWFPF 

51 SSGALMI LEV NTHNPRGASF DTMVKDLLGR GWNIING IAV ALVLYGSTYA 

101 YILV GGDLTA KGIGSAVGGK ISLTVGQLVF FGI LAFC VWA SARLVDRFTG 

151 V LIGGMVLTF IWATGGLV AD AKPSVLFDTQ APVGTGYWIY AATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIW AGT LVALVIYVLW QTAI QSNLPR 

251 NEFAPVI AAE RQLSVLNETL SKFAQTGDMD K ILSLFPYMA I ATSFLGV TL 

301 GLFDNIADIF KWNDSMSGRG TKTVALNFLP P LISWLLLPT GFFTAIGA SG 

351 LAATVWDQGI IPAMLLYVSP QKIGAGKTYK VYGGLW LMLV FLFGI ANIAA 

4 01 QVL SQMELVP VFKG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 359>: 

ml 02 . seq 

1 ATGCCCAACA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGGTCATC GGCGCAGGCA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCTATG 

151 CTTTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCATT ATCCGCACGG 

201 CGCAAGTTTC GACACGATGG TCAAAGACCT GCTCGGACGC GGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCGAC GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATCC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTCACCGGC 

4 51 GTCCTTATCG GCGGCATGGT ATTGACCTTT ATTTGGGCGG CCGGCGGGCT 

501 GATTGCCGAT GCCAAGCCGT CCGTCCTCTT CGATACCCAA GCCCCCGCCG 

551 GCACAAACTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAAGGCAA CCTGCCGCGC 

751 AACGAGTTCG CCCCCGTCAT CGCCGCCGAA GGGCAAGTCT CCGTCCTCAT 

801 CGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

851 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 

901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCATCTC 

951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCC CTGATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTCGTTACCG CCATCGGCTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TCTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCGTCAACAT CGCCGCACAG 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 



This corresponds to the amino acid sequence <SEQ ID 360; ORF 102>: 

ml 02 .pep. . 

1 MPNKTPSLFG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 

51 LSS GLMILEV NTHYPHGASF DTMVKDLLGR GWNII NGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGD VSLTVGQLVF FGI LAFC VWA SARLVDRFTG 

151 V LIGGMVLTF IWAAGGLI AD AKPSVLFDTQ APAGT NYWIY AATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKS IWTGT LIALVIYVLW QT AIQGNLPR 

251 NEFAPVI AAE GQVSVLIETL SKFAQTG NMP KILSLFSYMA I ATS FLGVTL 

301 GLFDYIADIF KWNDSISGRT KTAALTFLPP LISCLL FPTG FV TAIGYVGL 

351 AATVWTGII P AMLLYRSRKK FGAGKTYKV Y GGLWLMVWVF LFGIVN IAAQ 

401 VLSQMELVPV FKG* 



ml02/gl02 86.0% identity in 415 aa overlap 

10 20 30 40 50 60 

ml 02 .pep MPNKTPSLFGGAMI IAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 

I I I I I I I i I I I I I I I llllll: till: I : III 1 I I I I I I • I :: I I I I I I 
g 102 MSAKTPSLFGGAMIIAGKVIGAGMFPNPTANLGDGLIGSLIVLLYTWFPFSSGALMILEV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 02 . pep NTHYPHGASFDTMVKDLLGRGWNI INGI AVAFVLYLLTYAYI FVGGDLTAKGLGSAAGGD 

III I : I I I I I I I I I I I I I I I I I I I I I t I I I : I I I I I II I : I I I I I I I I I : t I i : I I 
gl02 NTHNPRGASFDTMVKDLLGRGWNIINGIAVALVLYGSTYAYILVGGDLTAKGIGSAVGGK 

70 80 90 100 110 120 



130 140 150 160 170 180 

rnl02 . pep VSLTVGQLVFFGI LAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 

: I I I I I I I I I I I I I i I I I I I I I I I I I M I I I I I I I I II I I I I I : I I I : I I I I I I I I I I M 
gl02 ISLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWATGGLVADAKPSVLFDTQ 

130 140 150 160 170 180 



ml02 . pep 



190 200 210 220 230 240 

APAGTNYWI YAATALPVCLASFG FHGNVSSLLKYFKGDAPKVAKSIWTGTLI ALVI YVLW 
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I I : I I : I I I I I I I I I ! I I I I I I I I I I I I I I I I I 1 I I I I I I i i I I I I I : I I I : I I I I I I I I 
gl02 APVGTGYWIYAATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKSIWAGTLVALVIYVLW 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVI AAEGQVSVLIETLSKFAQTGNMDKILSLFSYMAI ATSFLGVTL 

I I I I I : I I I I i I I I I I I I I I I : ! I I I I I I I I t I I I : I I It I I I I I I ! I I I I I I I I I I 
g 102 QTAIQSNLPRNEFAPVIAAERQLSVLNETLSKFAQTGDMDKILSLFPYMAI ATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

ml02 .pep GLFDYIADI FKWNDSISGR-TKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWT-GI 

I I I I I I I I I I i I I I : I I I I I I : I I : I I I I 1 I i i I : I I I I I I I I I I I I I II I f 
g l02 GLFDNIADIFKWNDSMSGRGTKTVALNFLPPLISWLLLPTGFFTAIGASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 390 400 410 

ml 0 2 . pep I PAMLLY RSRKK FGAGKT YKV YGGLWLMVWVFLFGI VN I AAQVLSQMELVP VFKGX 

I I I I I I I I : I : I I i I I I I I 1 I I I I M : I I I I I I : I I I I I I I I I I 1 I I I I I I i I 
gl02 I PAMLLYVSPQK I GAGKTYKVYGGLWLML-VFLFG I ANI AAQVLSQMELVP VFKGX 

370 380 390 400 410 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 36 1>: 

al02 . seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTGTTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

2 01 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

2 51 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

301 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 

351 AGGCGGCAAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCCAA GCCCCTACCG 

551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 

601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

701 TGGTAATTTA CGTCCTCTGG CAAACCGCCA TCCAANGCAA CCTGCCGCGC 

7 51 AACGAGTTCG CCCCCGTGAT TGCCGCCGAA GGGCAAGTCT CCGTCNTGAT 
801 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

8 51 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 
901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCGTGTC 
951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 

al02 .pep 

1 MPTKTPSLFG GAMIIAGTXI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 

51 LSSGLMILEV NTHYPHGAXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

101 YIFVGGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

151 VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VATALPVCLA 

201 SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

251 NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA I ATSFLGVTL 

301 GLFDYIADIF KWNDSVSGRT KTAALT FLPP XISCLLFPTG FVTAIGYVGL 

351 AATVWTGIIP AMLLYRSRKK FGAGKT YKV Y GGLWLMVWVF LFGIXNIAAX 

401 VLSQMELVPV FKG* 



ml02 / al02 



95.9% identity in 413 aa overlap 
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10 20 30 40 50 60 

ml02 . pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I I : I I I ! I I I I I I I I It I I I I I I I I I I I I! I I I M I I I I I I I I 11 I I i II I I i II I I I I 
a 102 MPTKTPSLFGGAMIIAGTXIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 02 .pep NTHYPHGASFDTMVKDLLGRGWNI INGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 

I M I I I I I I I I M M I I I I : I I I II I I I I I I I i 1 I I I I I I I M I I I I I I I I I I I I I I I : 
al02 NTHYPHGAXFDTMVKDLLGRSWNI INGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGN 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml02 . pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
I I I I I I I M I I I M I I I I I I I I I I I I I I I : I II I I I I I I M I I : I I I I I M I I I I I I I 
a 102 VSLTVGQLVFFGILAFCVWASARLVDRFTSVLIGGMVLTFIWATGGLIADAKLPVLFDTQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml02 . pep APAGTNYWIYAATALPVCLASFG FHGNVSSLLKYFKGDAPKVAKSIWTGTLIALVIYVLW 

I I : II I I I I I : M I I I I I I M I II I I I I I I I I E I I M I I I 1 M I I I I I I I II I M I I I I I 
al02 APTGTNYWIYVATALPVCIJVSFGFHGNVSSLLKYFKGDAPKVAKS IWTGTLIALVI YVLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml02 . pep QTAIQGNLPRNEFAPVIAMGQVSVLIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 
I I I I I I I I I I I I I I I I I I I I I t I I I I I I I 11 I I 1 I I I I I i I I I I I I I I I I I I I I I I M 
al02 QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml02 .pep GLFDYIADIFKWNDSISGRTKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWTGIIP 
M I I I I I I I I I I 1 I I : I I I M I II I I M I I I I I I I M I I I I I I I I I I I I I M I I I I I I I 
al02 GLFDYIADIFKWNDSVSGRTKTAALTFLPPXISCLLFPTGFVTAIGYVGLAATVWTGIIP 

310 320 330 340 350 360 

370 380 390 400 410 

ml 02 . pep AM L L YR S RKK FG AG KT Y KV Y G G L W LMVWV F L FG I VN I AAQ V L S QM E L V P V FKGX 

I M M I I M I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
a!02 AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIXNIAAXVLSQMELVPVFKGX 

370 380 390 400 410 



The following partial DNA sequence was 



g!05 - seq 






1 


Atgtccgcag 


a a a CAT AC Ac 


51 


gGgtctgcct 


atgGTAACGC 


101 


TATACAACCG 


CTCGCCCGAC 


151 


AAAGTTTACG 


GCagcACCGC 


201 


CCTGATGGTT 


TCCGACTATG 


251 


GCGACGGATT 


GGCCGGCAAA 


301 


ACCGAAAACC 


TCGCCGTCAA 


351 


TGCCGAAGCA 


CCCGTTTCCG 


401 


TGCTGATTCT 


GTTCGGCGGC 


451 


ATATTTTCCC 


TTGTCGGCAA 


501 


AGGCTCGGGC 


GCGAAACTCG 


551 


AAGCGTACAG 


CGAAGCGATG 


601 


GACACCATCG 


TCGAAGC CAT 


651 


TCAAACAAAA 


AAATCACTAT 


701 


CACTCAAACA 


CGCTTCCAAA 


751 


CAGG CAGGCA 


ACACCCTGCC 


801 


CAAAGCAGTT 


GAAGCCGGCT 


851 


TGAAATTGGC 


AGAACAC TG A 



identified in TV. gonorrhoeae <SEQ ID 363>: 



acAAAtcggc 
GGCTCTTGGA 
AAAACTGCCC 
CGAACTCGTC 
CCGCCGTGTG 
ATCATCGTCA 
AGCACTTGTC 
GATCGGTCGG 
AGCGAAGCCG 
AAAAACCTTC 
TCTTGAACTC 
CTGATGGCGC 
CGGCGGCTCG 
GGGCAAACCG 
GAcctTAACC 
CGCCGTCGAA 
ACGGCGAACA 



tGGgtaggct 
CGGCGGCATC 
CCATCTCcgc 
CGCGCCTGCC 
CGACATCCTG 
ACATGAGCAC 
GAAGCCGCAG 
ACCCGCCACC 
TTTTAAACCC 
CATTTCGGCG 
GCTCTTAGGC 
GGCAGTTCGG 
GCAATGGACT 
TGAGTTCCCC 
TCGccgtcAA 
ACCGTTGCTG 
GGACGTTTCC 



taggGcaaat 
GAAGTCGGCG 
C AAAGG AG C A 
CCGTCATTTT 
AACGGAGTCC 
CATCTCCCCG 
GCGGACAGTT 
AACGGCACAC 
GCTGCAAAAA 
ATGTCGGCAA 
ATTTTCGGCG 
CATCGATACC 
CGCCTATGTT 
CCTGCCTTTG 
AGAGCTTGAA 
CCAGCTACCG 
GGCGTTTACC 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105. ng>: 

gl05 -pep 



1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RAC PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TEN LAVKAL.V EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 I FSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

2 51 QAGNTLPAVE TV AAS YRKAV EAGYGEQDVS GVYLKLAEH 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 365>: 

ml05 . seq 



1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGaTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

2 01 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

2 51 GCGACGGATT GGCCGGCAAm ATCATCGTCA ACATGAG C AC CATCTCCCCG 

3 01 ACCGAAAaGC TCGCCGTCAA AGCACTTGTC GAAGCGCAGm GaCAGTTTGC 

3 51 CGAAGCACCC GTTTCCGGAT CGGTCGGGCC CGCCACCAAC GGCACGCTGC 

4 01 TGATTCTGTT CGGCGGCAGC GAAcCGtTTT AAACCCGCTG CAAAAAATAT 
4 51 TTTCCCTCGT CGGCAAAAAA ACCTTCCATT TCGGCGATGT CGGCAAAGGT 
501 TCGGGCGCGA AACTCGTCTT GAACTCGCTC TTGGGCATTT TCGGCGAaCG 
551 TAcAGCGAAs GmTg CTGATG GCGCGGCAGT TCGGCATCGA TACCGACACC 
6 01 ATCGTCGAAG CCATCGGsGA CTCGGCAATG GACTCGCCCA TGTTCCAAAC 

6 51 CAAAAAATCC CTGTGGGCAA ACCGCGAATT CCCGmCCGmC TTCGCCCTCA 

7 01 AACACGCCTC CAAAGACCTC AACCTCGCCG TCAAAGAGCT TGAACAGGCA 

7 51 GGCAACACCC TGCCCGCCGT CGAAACCGTT GCTGCCAGCT ACCGCAAAGC 
801 AGTCGAAGCC GGCTACGGGA CACAGGACGT TTCCGGCGTT TACCTGAAAC 

8 51 TGGCAGAACA CTGA 



This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>: 

ml 05 -pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGG I EVGVYNRSPD KTAPISAKGA 



51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGX IIVNMSTISP 

101 TEKLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

2 01 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

251 GNTLPAVETV AAS YRKAVE A GYGTQDVSGV YLKLAEH 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 105 shows 79.9% identity over a 289 aa overlap with a predicted ORF (ORF 105.ng) 
from N. gonorrhoeae: 

ml05/gl05 



gl05 . pep 



10 20 30 40 50 60 

MSAETYTQIGWVGLrGQMGLPMVTRLLDGGI EVGVYNRSPDKTAPI SAKGAKVYGSTAELV 



ml05 




10 20 30 40 50 60 



gl05 . pep 



70 80 90 100 110 120 

RACPVIFLMVSDYAAVCDILNGVRDGLAGKI IVNMSTISPTENLAVKALVEAAGGQFAEA 



ml05 




70 80 90 100 110 



g!05 . pep 



130 140 150 160 170 180 

PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 



ml05 



PVSGSVGPATNGTLLILFGGSEPFXTRCKKXTPSSAKKP-SISAMSAKVRARNSSXTRSW 
0 130 140 150 160 170 





120 
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190 200 210 220 230 240 

gl05 .pep I FGEAYSEAMLMARQFG IDTDTI VEAI GGS AMDS PMFQTKKS LWANRE FPPAFALKHAS K 

I : : : I I I I I I I I i I II I I I I I I I I I I I I M I I I t I I I I I I I I I I I I I I I I I 

ml05 AFSANVQRXXLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXXFALKHASK 
180 190 200 210 220 230 



gl05 . pep 
ml05 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 367>: 

al05 . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG 

2 51 GCGACGGATT GGCCGGCAAA ATCATCGTCA 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG 

4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG 

7 01 CCCTCAAACA CGCCTCCAAA GACCTCAACC 

7 51 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 368; ORF 105.a>: 

al05 . pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TEN LAVKAL V EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KS LWANRE FP PAFALKHASK DLNLAVKELE 

2 51 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 



250 260 270 280 289 

D LN LAV KE L E Q AGNT L P AVE TV AAS YR KAVE AG YG E QD VS G VY LKLAE H 

MIIIMIIIIIIIIIIIIIIIIIIMIMIIIM lllllllllllll 

DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGTQDVSGVYLKLAEH 
240 250 260 270 280 



TGGATAGGCT 
CGGCGGCATC 
CCATCTCCGC 
CGCGACTATC 
CG AC AT C C T G 
ACATGAGCAC 
GAAGCCGCAG 
GCCCGCCACC 
TTTTAAACCC 
CATTTCGGCG 
GCTCTTGGGC 
GGCAGTTCGG 
GCAATGGACT 
CGAATTCCCA 
TCGCCGTCAA 
ACCGTTGCTG 
GGACGTTTCC 



TAGGGCAAAT 
GAAGTCGGCG 
CAAAGGCGCA 
CCGTCATTTT 
AACGGAGTCC 
CATCTCCCCG 
GCGGACAGTT 
AACGGCACGC 
GCTGCAAAAA 
ATGTCGGCAA 
ATTTTCGGCG 
CATCGATACC 
CGCCCATGTT 
CCCGCCTTCG 
AGAGCTTGAA 
CCAGCTACCG 
GGCGTTTACC 



ml05/al05 96.5% identity in 289 aa overlap 

10 20 30 40 50 60 

ml05 . pep MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
I I M I t : I I I I I I I I M I t I I I I I I I I I I I ! I M I i I I I II I I I M It I I I I 1 I I I I I I I 
al05 MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 119 

mlOS.pep RD Y PV I FLMVS DYAAVCD I LNGVRDGLAGKI I VNMST I S PTENLAVKALVEAAG -QFAE A 

I I I I M I i I I I I M I I I I I M I I I I I I I I I M M I I i I I I I I I I I I I I I I I I I I 1 I IE I 
a 105 RDYPV I FLMVS DYAAVCDI LNGVRDGLAGKI I VNMST I SPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

120 130 140 150 160 17 0 17 9 

ml05 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKT FHFGDVGKGSG AKLVLNSLLG 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I! I M I I I I M I II II M I 
al05 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKT FH FGDVGKGSG AKLVLNSLLG 

130 140 150 160 170 180 

180 190 200 210 220 230 
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IFGDV-QRXMLMARQFGIDTDTIVEAIGDSAMDSPMFQTKKSLWANREFPXAFALKHASK 
Ml:: : : I I I I I I I I I I I I M I I M I I I I M I t II II I I I I I If I II MINIMI 
IFGEAYSEAML^4ARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
190 200 210 220 230 240 

240 250 260 270 280 

DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I M I M II II I I M I I I I I I M M I I I I I M M I II I I I M I I I I M 
DLNLAVKELEQAGNTLPAVETVAASYRICAVEAGYGEQDVSGVYLKLAEHX 
250 260 270 280 290 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 369>: 

gl05-l . seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

8 51 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ED 370; ORF 105-1 .ng>: 

gl05-l .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV SDYAAVCPIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEAM LMARQFGIDT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 371 >: 

ml05-l . seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

7 01 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 
751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

8 51 TGAAACTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>: 

ml05-l .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDYPVI FLMV SDYAAVCPIL NGVRDGLAGK IIVNMSTISP 



mlOS . pep 
al05 

ml05 . pep 
al05 
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101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG IFGEAYSEXM LMARQFGIDT 

2 01 DTI VEAIGDS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

2 51 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

ml05-l/gl05-l 96.9% identity in 289 aa overlap 

10 20 30 40 50 60 

ml05-l -pep MSANEYAQIGW IGLGQMGLPMVTRLLDGGI EVGVYNRSPDKTAPISAKGAKVYGNTAELV 
Mi: t : I i I I : I I I I I I t I I I I I i I I I I I I I I I M t ! I I j I I 1 I I I I 1 I I I I I : I I I I I 
gl05-l MSAETYTQI GWVGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPI SAKGAKVYGSTAELV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml05-l .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
g 1 0 5- 1 RACPVI FLMVSDYAAVCDI LNGVRDGLAGKI I VNMSTI S PTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 05- 1 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I | | | I I I I I I I I I I I I I I I I I I I I I II I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I M 
gl05-l PVSGSVGPATNGTLLILFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 



190 200 210 220 230 240 

mlOS-l.pep I FGEAYSEXMLMARQFGIDTDT I VEAIGDS AMDS PMFQTKKSLWANREFP PAFALKHASK 
I | | | | I | I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 
gl05-l I FGEAYSEAMLMARQFGIDTDT I VEAIGGS AMDS PMFQTKKSLWANREFP PAFALKHASK 

190 200 210 220 230 240 



250 260 270 280 290 

mlOS-l.pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I I 1 I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 1 I I II I I I I I I 
g 105-1 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 373>: 

al05-l . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

2 51 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-1. a>: 

al05-l.pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK I I VNMSTI SP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGEAYSEAM LMARQFGIDT 

201 DTI VEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH* 

al05-l/ml05-l 99.0% identity in 289 aa overlap 

10 20 30 40 50 60 

a 105-1 - pep MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
I I I I I I : I J I I J II I I I I I I M I I I I I I I I I I I I I | I I I I M I I I I I I I I I I I I I I I I I I 
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ml 0 5-1 MSANEYAQIGWIGLGQMGLPMVTRLLDGGI EVGVYNRSPDKTAPISAKGAKVYGNTAELV 

10 20 30 40 50 60 

70 80 90 100 . 110 120 

a 105-1 .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTI SPTENLAVKALVEAAGGQFAEA 
} I I I I I I I I I I I I I M I i f I I I I M I I I 1 M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 
ml 05- 1 RDYPVI FLMVSDYAAVCDILNGVRDGLAGKI I VNMSTI SPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

a 105-1 . pep PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
ml 05-1 PVSGSVGPATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

190 200 210 220 230 240 

al05-l .pep IFGEAYSEAMLMARQFGIDTDTIVEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
| I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 
ml05-l I FGEAYSEXMLMARQFGIDTDTI VEAIGDSAMDSPMFQTKKSLWANREFPPAFALKHASK 

190 200 210 220 230 240 

250 260 270 280 290 

al05-l . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I M I I I I I I 1 I II I I 1 I I I I M M I I I I I I I I I I I I I M I I I I I I I I 
ml05-l DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 375>: 

gl07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 

51 ACCGTCCGTC CTCTTCGACA CCCAAGCCCC CGTCGGCACC GGCTACTGGA 

101 TTTACGCCGC CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTTT CCAGCCTGCT CAAATACTTT AAAGG CGACG cgcCcaaagt 

201 GgCGAaATCc atctggGcag gtacattggT TGCCttggta atttacgtcc 

251 TCTggcaaac cgccatCcaa agcaaccTGC cgcgcaacga gttcgcCCCc 

301 gtgattgccg ccgagaggca actCTCCGTC CTgaatgaaa cccTGtccaa 

351 attcgcccaa accggcgata tggataAaat attgtcccta tttccctaca 

401 tggcaatcgc cacctccttt ttaggcgTAA Ccttaggcct gtttgacaac 

4 51 atcgccggac atcttcaaat ggaacgacag tatgtccggg cggcaccaaa 

501 accgtcgcgc tga 
This corresponds to the amino acid sequence <SEQ ID 376; ORF 107.ng>: 

gl07 .pep 

1 MVLTFIWATG GLVADA KPSV LFDTQAPVGT GYWIYAATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS I W AGTLVALV IYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMD KILSL FPYMAIATSF LGV TLGLFDN 

151 IAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 377>: 

ml07 . seq 

1 ATGGTATTGA CCTTTATTTG GGCGGCCGGC GGGCTGATTG CCGATGCCAA 

51 GCCGTCCGTC CTCTTCGATA CCCAAGCCCC CGCCGGCACA AACTACTGGA 

101 TTTACGCCGs CACCGCCCTG CCCGTCTGCC TCGCTTCCTT CGGCTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGG CGACG CGCCCAAAGT 

201 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

3 01 GTCATCGCCG CCGAAGGGCA AGTCTCCGTC CTCATCGAAA CCCTGTCCAA 

3 51 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

401 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 

451 ATCGCCCATC TTCAAATGGA ACGACAGCAT CTCCGGgCCG CACCAAAACC 

501 GCCGCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml 07 . pep . . 

1 MVLTFIWAAG GLIADA KPSV LFDTQAPAGT NYWIYAXTAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV IYVLWQTAIQ GNLPRNEFAP 
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101 VIAAEGQVSV LIETLSKFAQ TGNMD KILSL FSYMAIATSF LGV TLGLFDY 
151 IAHLQMERQH LRAAPKPPR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 107 shows 89.4% identity over a 170 aa overlap with a predicted ORF (ORF 107.ng) 
from N. gonorrhoeae: 

ml07/gl07 

10 20 30 40 50 60 

MVLT F I WAAGGL I ADAKPS VLFDTQAP AGTNYW I YAXTAL P VCLAS FGFHGNVSSLLKYF 

I 1 1 I t M I = I I I = I I i It ! I M 1 1 1 1 I = I I = M I II 1 1 I II I I M I I I I I 1 1 I 1 1 I I II 

MVLTFIWATGGLVADAKPSVLFDTQAPVGTGYWIYAATALPVCLAS FGFHGNVSSLLKYF 
10 20 30 40 50 60 

70 80 90 100 110 120 

KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

I I I | | II I II I h I I I = I II I I I I I I I I I I : I I I I I I I M I I I I I hill MINIM 
KGDAPKVAKS I WAGTLVALVI YVLWQTAI QSNLPRNE FAPVI AAERQLS VLNETLS KFAQ 
70 80 90 100 110 120 



ml 07 . pep 
gl07 

ml 07 . pep 
gl07 



130 140 150 160 170 

ml07 . pep TGNMDKILSLFSYMAIATS FLGVTLGLFDYIA-HLQMERQHLRAAPKPPR 

MMIIIMM 1 1 I 1 1 I 1 1 1 1 1! I M I I II IIIIMhMIIIII I 

gl07 TGDMDKI LSDFPYMAI ATS FLGVTLGLFDNI AGHLQMERQYVRAAPKPSR 

130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 379> : 

al07 . seq 

CCTTTATTTG GGCAACCGGC GGCCTGATTG CCGATGCCAA 
CTCTTCGACA CCCAAGCCCC TACCGGCACC AACTACTGGA 
CACCGCCCTG CCCGTCTGCC TTGCGTCATT CGGTTTCCAC 
CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 
ATC.TGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 
CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 
CCGAAGGGCA AGTCTCCGTC CTGATTGAAA CCCTGTCCAA 
ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 
CACCT CGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 
TCTTCAAATG GAACGACAGC GTGTCCGGCC GCACCAAAAC 
ACCTTCCTGC CGCCTCTAAT TTCCTGCCTG CTCTTCCCCA 
TACCGCCATC GGCTACGTCG GCCTGGCGGC AACCGTCTGG 
TCCCCGCCAT GCTGCTCTAC CGTTCGCGCA AAAAATTCGG 
ACCTATAAAG TTTACGGCGG CTTGTGGCTG ATGGTTTGGG 
CGGCATCGTC AACATCGCCG CACAGGTATT GAGCCAAATG 
CCGTATTTAA AGGATAA 

This corresponds to the amino acid sequence <SEQ ID 380; ORF 107.a>: 
al07 .pep 

1 MVLTFIWATG GLIADAKLPV LFDTQAPTGT NYWIYVATAL P VCLAS FG FH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV I YVLWQTAI Q GNLPRNEFAP 

101 VIAAEGQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 IADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGI IPAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

251 ELVPVFKG* 



1 


ATGGTATTAA 


51 


ACTGCCCGTC 


101 


TTTATGTCGC 


151 


GGCAACGTCT 


201 


GGCTAAATCC 


251 


TCTGGCAAAC 


301 


GTGATTGCCG 


351 


ATTCGCCCAA 


401 


TGGCGATCGC 


451 


ATCGCCGACA 


501 


CGCCGCGCTG 


551 


CCGGCTTTGT 


601 


ACAGGCATCA 


651 


CGCAGGCAAA 


701 


TCTTCCTTTT 


751 


GAACTCGTCC 



ml07/al07 94.8% identity in 154 aa overlap 

10 20 30 40 50 60 

ml07 . pep MVLT FI WAAGGL I AD AKPSVLFDTQAPAGTNYW I YAXTALPVC LAS FGFHGNVSSLLKYF 

M I I I I I I : I I I I I I I I I I It I I i t : I I I I M I : I I I II I I I I I I I I 1 M M I I I I I 
al07 MVLTFIWATGGLIADAKLPVLFDTQAPTGTNYWIYVATALPVCLASFG FHGNVSSLLKYF 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml07 .pep KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 
f I I I I I I M I I II I I I I M I I I I I t I I. I i I I I I. I I M I I II I I I I I I I I I I I II I It I i I 
a 107 KGDAPKVAKSIWTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml07 .pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIAHLQMERQHLRAAPKPPRX 

I I I I II II II I I I I II I I M I I I I I I I I I I I I : 
a 107 TGNMDKILSLFSYMAIATSFLGVTLGLFDYIADI FKWNDSVSGRTKTAALTFLPPLISCL 

130 140 150 160 170 180 

al07 LFPTGFVTAIGYVGLAATVWTGI IPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIV 

190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 381>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GATATTC Aa a cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc ccTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG GCAATCTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCG TGCGGAATTG 

301 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 

3 51 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 

401 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa ccagtCCacG 

451 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 

501 CAaacgcctg tACCGCTAa 

This corresponds to the amino acid sequence <SEQ ID 382; ORF 108. ng>: 

gl08 .pep 

1 MLPGFNRIFK RFAPTLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNKT LSILPA AILLGGCAAG G NTFGSLDGG TGMGGSIVKM TVESQCRAEL 

101 DRRSEWRLTA LAMSAEKQAE WENKI CGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 383>: 

ml08 . seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTTG GGAGCCAATG CCGTGCGGAA 

3 01 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

3 51 AAAACAGGCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

401 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 

451 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

ml 08 . pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGS I VK MAVGSQCRAE 

101 LDKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 108 shows 89.6% identity over a 173 aa overlap with a predicted ORF (ORF 108.ng) 
from N. gonorrhoeae: 

ml08/gl08 
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10 20 30 40 50 60 

ml08 .pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

I II I I MM I Ihlll I MINI Ml MINI III I M 1 1 1 M I M I II Ml I I 1 1 I h 

gl0 8 MLPGFl^I FKRFAPTLGTAHICTPPFALSRTGRLI RSYRHKRRGFNRKGI EMNKTLS I LPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mlO 8 . pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAEI^KRSEWRLTALiAMSAEKQA 

I I I M I M I I I 1 1 1 I I I 1 1 1 I I I I I I M 11 = I II I M II h 1 1 I M II 1 1 1 1 I I I M I 

gl08 AILLGGCAAGG-NTFGSLDGGTGMGGSIVKMTVESQCRAEI^RRSEWRLTALAMSAEKQA 

70 80 90 100 110 

130 140 150 160 170 

ml08 . pep E W E N K I CACV AQ E AP E RMTGND VMQMLAP S TRNQ ALAAL T AKTVS AC F KH L YRX 

I I M I M : I : : : I I I : : : I I I I I I I I I I M I I I I I M M I I I M I I M I II 
gl08 EWENKICGCATEEAPNQLTGNDVMQMLNQSTRNQALAAIjWKTVSACFKRLYRX 
120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 385>: 

al08 . seq 

1 ATGTTGCCGG GCTTCAACCG GAT AT TC AAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA T CGCCAT AAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGG GGCGGTAACA CATTCGGCAG CTTAGACGGC GGCACAGGTA 

251 TGGGCGGCAG CATCGTCAAA ATGGCGGTAG AAAGCCAATG CCGTGCGGAA 

301 TTGAACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

351 AAAACAGGCG GAATGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 

4 01 CACCCAACCA GCTGACCGGC AACGATGTGA TGCAGATGCT GGATCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 

501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 386; ORF 108.a>: 

al08 .pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVESQCRAE 

101 LNKRSEWRLT alamsaekqa EWENKICACV AQEAPNQLTG NDVMQMLDPS 

151 TRNQALAALT AKTVSACFKH lyr* 



ml08/al08 96.5% identity in 173 aa overlap 

10 20 30 40 50 60 

ml08 .pep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 
I I I I I M 1 M I I II I I I M I I It I M I I I I I I I I I I I II M I I 1 I I I I I I I I II M I I i I 
al08 MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 08 . pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 
M 1 I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I : II I I I I I I M I M I I I I I 
al08 AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVESOCRAELNKRSEWRLTALAMSAEKQA 

70 80 90 100 110 120 

130 140 150 160 170 

ml08 . pep EWENK I CAC VAQE APERMTGN D VMQMLAPS TRNQAL AALTAKT V S AC FKHL YRX 

I I I I I I I I I I II I I I ::: I I I I I I I I I I I i I I 1! I I I I I I I I I I I I I I I I I M 
al08 EWENKICACVAQEAPNQLTGN DVMQML DPS TRNQALAALT AKTV SAC FKHLYRX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 387>: 
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gl09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GGTTTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

251 TTGTCGGACG AGTAT TGAT A CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

3 01 GCG AT GGGTG CGGT CGGG AT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 

3 51 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109.ng>: 

gl09 -pep 

1 MYYRRWGLS DGLGDLAAG I DRRRMLTAFG SGHGNDAQRQ NHPIRRHRGV 

51 LFRLVNPVFG WALTMLLDNL GLIGCKERSA QLGFVGR VLI PAVGFLILCV 

101 AMGA VGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 389>: 

ml09 . seq 

1 - ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTCCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

251 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAA CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml 09 . pep 

1 MYYRRVMGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFAG R VLIPAVGFL 

101 I LCVAMGAV G MLPGIPPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 
from N. gonorrhoeae: 

ml09/gl09 

10 20 30 40 50 60 

ml09 . pep MYYRRVMGLSDGLGDLAAGI ERSLGRRRI LTAFGSGHGNDAQRQNHPI RRHRGVLFRLVN 

I I ! I I M ; I I II I I I IIIM : hi III I ! iill MINI ,1 I I MM M Ml I I 

g 1 0 9 MYYRRWGLS DGLGDLAAG IDR RRMLTAFGSGHGNDAQRQNHP I RRHRGVL FRLVN 

10 20 30 40 50 



70 80 90 100 110 120 

ml09 . pep PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLI PAVGFLILCVAMGAVGMLPGI PPFLE 

I I IIMMMM Ml MMM MMM MMMIMIMMM MUM MIM MMM 

gl09 PVFGWALTMLLDNLGLIGCKERSAQLGFVGRVLI PAVGFLILCVAMGAVGMLPGI PPFLE 

60 70 80 90 100 110 



ml09.pep HFKSLGX 

MINI 

gl09 QFKSLGX 
120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 391>: 

al09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CACCGTGGTG TTCTCTTCCG CTTGGTCAAT CCGGTTTTCG GCTGGGCGTT 
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201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

2 51 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 
301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

3 51 "GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 109>: 
al09 .pep 

1 MYYRRVVGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPIRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG R VLIPWGFL 

101 ILCVAMGAVG MLPGIPPFLE HFKSLG* 



ml09/a!09 97.6% identity in 126 aa overlap 

10 20 30 40 50 60 

ml09 . pep MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 
I I I I I I : I I I I I I I I I I I I ! I I I I I I I I I I I I I I I II M I I It I I I M I II I I I I i I I I I 
al09 MYYRRWGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml09 . pep PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLIPAVGFLILCVAMGAVGMLPGIPPFLE 
I I I I I I I I I I I I I I I I I I I I I ! I I I I I I : I I I I I i : M I I I I I I I I M I M t I N I I I 1 I 
al0 9 PVFGWALTMLLDNLGLIGCKERSAQLG FTGRVLI PWGFLILCVAMGAVGMLPGI PPFLE 

70 80 90 100 110 120 

ml 0 9. pep HFKSLGX 
I I 1 I I I I 

al09 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>: 

gill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCC AAAATAC AAAAGCGCAT 

2 01 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtCCaCC TACCAGACCG 

2 51 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 
301 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 

3 51 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394; ORF 111 .ng>: 

gill .pep 

1 MPSETRLPNL IRALIFALGF I FLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 AS ITDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 395>: 

mill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACGCAT 

201 CGATGACGCG CTTAAAGAAk TCAAC CGGyA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

3 51 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATT CGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 
4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 
501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GAT TGCC AAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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601 CTGGAAAAAT ACGG C ATT C A AAATTAT CTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CG C AG ATT AT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

8 01 TAAAAACGGC AAACGCCTCT CC CATAT CAT CAACCCGAAC AACAAACGAC 

8 51 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAAC CGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

10 01 ATAAAGGCGG cTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGcTC 

10 51 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 

mill. pep 

1 MPSETRLPNF IRVLIFALGF IFLNA CSEQT AQTVTLQGET MGTTYXVKYL 

51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEI SRF NQHTAGKPLR 

101 I S SDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKI ILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQTIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPI SHNLA SISWADSAM 

3 01 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 
351 R* 

ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 1 .ng) 
from N. gonorrhoeae: 

mill .pep/gill .pep 

10 20 30 40 50 60 

mill . pep MPSETRLPNFIRVLIFALGFIFUJACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 

1 1 I M I I M = I i = 1 1 1 1 1 1 i 1 1 1 1 i M M M i 1 1 1 1 1 1 1 ! I ! I I f = 1 1 1 1 It 1 1 1 1 I 1 1 1 

gill MPSETRLPNLI RALI FALGFI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPS P 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill . pep AE I XKR I DDALKEXNRXMSTYQPDSE I SRFNQHTAGKPLRI S S DFAHVTAEAVRLNRLTH 

hi IIIIIMM 11 Mill 1111(11 I :|h 
gill AKIQKRIDDALKEVNRQMSTYQTDSEISRFIQTXAGELFAXHAXSITDSAEDCLPNTPIS 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill . pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKD YASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 397>: 

alll . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

2 51 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 
4 51 ATCAAACAAG CAGCATCTTA TACGGGCATA G AC AAAAT C A TTTTGAAACA 
501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 
651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 
701 AACAGCCCAA CATCGTCCAA GGCGGCAATA C G C AG AT T AT CGTCCCGCTG 
751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

8 51 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 1 l.a>: 
alll .pep 

1 MPSETRLPNF IRTLIFALSF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GAL DVT VG PL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQI IVPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHI INPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 

mlll/alll 97.7% identity in 351 aa overlap 

10 20 30 40 50 60 

mill . pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 
I I I I I I I II II I : I t I I I : 1 I I I II I I 1 I I I I I I I I I I I I I I I I I : I I I i I I I I M M I I 
alll MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill .pep AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
III I I I I I I I I I II I I M I I I I I I I I I I I I I If I I I f I I I I I II I I I I I I : I M I M 
alll AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

mill .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKDYASLSKTHPK 

I I I I I I I I I I I I I! II I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I II I I I II 
alll GALDVTVGPLVNLWGFGPDKSVTRE PS PEQIKQAASYTG I DKIILKQGKD YASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

mill .pep AYLDL S S I AKGFGVDKVAGELEKYG I QN YLVE I GGE LHGKGKN ARGE PWR I G I EQPN I VQ 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I M M I I I M I I I I I I I 
alll AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 1 1 . pep GGNTQI I VPLNNRSLATSGDYRI FHVDKNGKRLSHI INPNNKRPI SHNLAS I S WAD SAM 

I M M I I I I I I I I I I I I I I II M I M I I : I M I I I I I II I I I I I I I I I I I I I I I I I I I ! I 
alll GGNTQI IVPLNNRSLATSGDYRIFHVDKSGKRLSHI INPNNKRPI SHNLASISVVADSAM 

250 260 270 280 290 300 

310 320 330 340 350 

mill .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I II I I i I I I I I I I I 
alll TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 399>: 

glll-l . aeq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TACCAGACCG 
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251 ATTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ATTTCGCACA CGTTACCGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACTCAC GGCGCACTGG ACGTAACCGT CGGCCCTTTG GTCAACCTTT 

401 GGGGGTTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

451 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGCAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAA GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAG7 TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCggcGAGTT 

651 GCACGGCAAA GGCAAAAATG CGCACGGCGA ACCGTGGCGC ATCGGTATAG 

701 AGCAACCCAA TATcatccaa ggcggcaata cgcAGattat cgtcccgctg 

751 aaCaaccgtt cgcttgccac ttccggcgAT taccgtaTTT tccacgtcgA 

801 TAAAAACGGC Aaacgccttt cccacATCAT CAATCCCAAC AACAAACGAC 

851 CCATCAGcCA CAAcctcgcc tcCATCAgCg TGGTCTCAGA CAGTGCAATG 

901 ACGGCGGACG GTTTATCCAC AGGATTATTT GTTTTAGGCG AAACCGAAGC 

951 CTTAAGGCTG GCAGAACAAG AAAAACTCGC TGTTTTCCTA ATTGTCCGGG 

1001 ATAAGGACGG CTACCGCACC GCCATGTCTT CCGAATTTGC CAAGCTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 400; ORF 1 1 l-l.ng>: 

glll-l .pep 

1 MPSETRLPNL IRALIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GAL DVT VG PL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL I VRDKDGYRT AMSSEFAKLL 

351 R* 



The following partial DNA sequence was identified in TV. meningitidis <SEQ ED 40 1>: 

mlll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAACGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

401 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

7 51 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

ml 1 1 - 1 .pep 

1 MPSETRLPNF IRVLIFALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQIIVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



mlll-l/glll-1 96.6% identity in 351 aa overlap 



10 20 30 40 50 60 

mlll-l .pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
I I I I I M I I : I I : I I I I I I I I M I I I t I I I M I I i I I I I I I I I I I I I I I I I I I I I I I I I I 
glll-l MPSETRLPNLI RALI FALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPS P 
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10 20 30 40 50 60 

70 80 90 100 110 120 

mlll-l.pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
I : I I I I I I M I I I I I I I I I I I I II I I I i I I I i I t I I I I I I I t I I I I 1 I i I I ! I I I I I I I 
glll-1 AKIQKRI DDALKEVNRQMSTYQTDSEI SRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 11-1 -pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I I I I I I I 1 I I I I M I I I I I I I i t I I I I I I I I I I I I I I I I M I I : I I I I I I I I I I I I I I 
glll-1 GALDVTVGPLVNLWGFG PDKSVTREPSPEQIKQAASYTGI DKI I LQQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 11-1 -pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I I I I I j I I I I I I I I I I I I M 1 I I I II I I I I I I I I I 1 I 11 I I i I : I I I I I I I I M I i I : I 
glll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNAHGEPWRIGIEQPNIIQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 11-1 .pep GGNTQI I V PLNNRSLATSGDYRI FH VDKNGKRLSH UN PNNKRP I SHNLAS I S WADS AM 
I I I I I I I I I I I I I I II I I I M I I I I If I I I I I I I I I I I I I I I I I I I I I I I I I I II : II I I 
glll-1 GGNTQI I VPLNNRS LATSGDYRI FHVDKNGKRLSHIINPNNKRPI SHNLASISWSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

ml 11-1 .pep TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 
I I I I I I It I I I I I I I I I I : I II : I I I I I i I I I i I ! I I I I I I I I I I I I I I I 
glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 

310 320 330 340 350 

glll-l/p44550 

sp| P4 4 550 | YOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi | 1074292 I pir | | C6414 4 
hypothetical protein HI0172 - Haemophilus influenzae {strain Rd KW20) >gi 11573128 (U32702) 
lipoprotein, putative [Haemophilus influenzae Rd] Length = 346 
Score = 349 bits (885), Expect = 2e-95 

Identities « 177/328 (53%), Positives = 240/328 (72%), Gaps = 4/328 (1%) 

Query: 23 LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRIDDALKEXNRXMSTYQ 82 

L AC ++T + ++L G+TMGTTY VKYL + S + + 1+ LK+ N MSTY+ 

Sbjct: 17 LAACQKET-KVISLSGKTMGTTYHVKYLDDGSITATS-EKTHEEIEAILKDVNAKMSTYK 7 4 

Query: 83 PDSEISRFNQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 141 

DSE+SRFNQ+T P+ IS+DFA V AEA+RLN++T GALDVTVGP+VNLWGFGP+K 

Sbjct: 75 KDSELSRFNQNTQVNTPIEISADFAKVLAEAIRLNKVTEGALDVTVGPWNLWGFGPEKR 134 

Query: 142 VTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPKAYLDLSSI AKGFGVDKVAGEL 201 

++P+PEQ+ + + + GIDKI L K+ A+LSK P+ Y+DLSSIAKGFGVD+VA +L 
Sbjct: 135 PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 194 

Query: 202 EKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 2 61 

E+ QNY+VEIGGE+ KGKN G+PW+I IE+P + ++ LNN +A+SGDY 

Sbjct: 195 EQLNAQN YMVE I GGE I RAKGKN I EGKPWQI AI EKPTTTGERAVEAVI GLNNMGMAS SGDY 254 

Query: 262 RI FHVDKNGKRLSHI I N PNNKRPI SHNLAS I S WADS AMTADGLSTGLFVLGETEALKLA 321 

RI+ ++NGKR +H I+P PI H+LASI+V+A ++MTADGLSTGLFVLGE +AL++A 
Sbjct: 255 RIY-FEENGKRFAHEIDPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 313 

Query: 322 EREKLAVFLIVRDKGGYRTAMSSEFEKL 34 9 

E+ LAV+LI+R G+ T SS F+KL 
Sbjct: 314 EKNNLAVYLI I RTDNGFVTKSSSAFKKL 341 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 403>: 

alll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 



CCTGACACAC 
GGGGATTCGG 
ATCAAACAAG 
AGGCAAAGAT 
ATTTATCTTC 
CTGGAAAAAT 
GCACGGCAAA 
AACAGCCCAA 
AACAACCGTT 
TAAAAGCGGC 
CCATCAGCCA 
ACGGCGGACG 
CTTAAAGCTG 
ATAAAGGCGG 
CGCTAA 



GGCGCGCTGG 
CCCCGACAAA 
CAGCATCTTA 
TACGCTTCCT 
GATTGCCAAA 
ACGGCATTCA 
GGCAAAAACG 
CATCGTCCAA 
CGCTTGCCAC 
AAACGCCTCT 
CAACCTCGCC 
GCTTGTCCAC 
GCAGAGCGCG 
CTACCGCACC 



ACGTAACCGT 
TCCGTTACCC 
TACGGGCATA 
TGAGCAAAAC 
GGCTTCGGCG 
AAATTATCTG. 
CGCGCGGCGA 
GGCGGCAATA 
TTCCGGCGAT 
CCCATATCAT 
TCCATCAGCG 
AGGATTATTC 
AAAAACTCGC 
GCCATGTCTT 



CGGCCCCTTG 
GTGAACCGTC 
GACAAAATCA 
CCACCCCAAG 
TTGATAAAGT 
GTCGAAATCG 
ACCTTGGCGC 
CGCAGATTAT 
TACCGTATTT 
TAATCCGAAC 
TGGTCGCAGA 
GTATTGGGCG 
TGTTTTCCTG 
CCGAATTTGA 



GTCAACCTTT 
GCCGGAACAA 
TTTTGAAACA 
GCCTATTTGG 
TGCGGGCGAA 
GCGGCGAGTT 
ATCGGCATCG 
CGTCCCGCTG 
TCCACGTCGA 
AACAAACGAC 
CAGTGCGATG 
AAACC GAAGC 
ATTGTCAGGG 
AAAACTGCTC 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 1 1-1 .a>: 



alll-l .pep 

1 MPSETRLPNF 



IRTL IFALSF I FLNACS EQT 



51 SNNRDKLPSP 

101 ISSDFAHVTA 

151 IKQAASYTGI 

201 LEKYGIQNYL 

251 NNRSLATSGD 

301 TADGLSTGLF 

351 R* 



AEIQKRI DDA 
EAVHLNRLTH 
DKIILKQGKD 
VEIGGELHGK 
YRIFHVDKSG 
VLGETEALKL 



LKEVNRQMST 
GALDVTVGPL 
YASLSKTHPK 
GKNARGEPWR 
KRLSHIINPN 
AEREKLAVFL 



AQTVTLQGET 
YQPDSEISRF 
VNLWGFGPDK 
AYLDLSSIAK 
IGIEQPNIVQ 
NKRPISHNLA 
IVRDKGGYRT 



MGTTYTVKYL 
NQHTAGKPLR 
SVTREPSPEQ 
GFGVDKVAGE 
GGNTQI I VPL 
SISVVADSAM 
AMSSEFEKLL 



alll-l/mlll-1 



98.9% identity in 351 aa overlap 



10 20 30 40 50 60 

alll-l. pep MPSETRLPNFIRTLI FALS FI FLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPS P 
I j i I I I I I I I I I : I I I M : I I I I I I I I I E I I I I I I I I I I II I I I I I I I M H I i I I I I I I 
ml 11-1 MPSETRLPNFIRVLI FALGFI FLNACS EQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

alll-l .pep AEIQKRI DDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRI SSDFAHVTAEAVHLNRLTH 
I I I 1 I I I I I I I I I I I ! t I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I II I I : t I I I I 1 
ml 11-1 AEIQKRI DDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRI SSDFAHVTAEAVRLNRLTH 

70 80 90 100 110 120 

130 140 150 160 170 180 

alll-l .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I I I I I I I I I I I I I I I I I I I If I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
mil 1-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 

130 140 150 160 170 180 

190 200 210 220 230 240 

alll-l .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 11-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

alll-l. pep GGNTQI I VPLNNRSLATSGDYRI FHVDKSGKRLSHI INPNNKRP I SHNLAS I S VVADSAM 
I E M I I I I I I M I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I 1 I I 1 I I I I I I I I I I I I 
ml 11-1 GGNTQI I VPLNNRSLATSGDYRI FHVDKNGKRLSH I INPNNKRPI SHNLAS IS VVADSAM 

250 260 270 280 290 300 



310 320 330 340 350 

alll-l .pep TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 
I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I 
ml 11-1 TADGLSTGLFVLGETEALKLAEREKLAVFLI VRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 405>: 

gll4 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAGCAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 T ACGGT C AAA GCGGCTATTT TACCAGAGCC GCCGAATGTA AAACAGGGTG 

2 01 T CAGGG CAT C AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

2 51 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

3 51 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406; ORF 1 14.ng>: 

gll4 . pep 

1 MAS I TS PLHG AQQECSKTFL CPPGGTSMGR SM SVTVGLFC VSINLTISV E 

51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSSS ETSTSDMACA 

101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNASLTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>: 

m!14 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCACAGAG AATGCAGCAA 

51 GACTTTTTTA TGTCCACCGG GCGGGACGAG TATAGGGCGG TCAATGTCGG 

101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTTGAA 

151 TACGGTTGAA GCGGCTATTT TAT CAGAGC C GCCGCATGTA AAACAGAGTG 

2 01 TCAGGGCATC AACCCGAGCT GTCTGAACGA ACAGACGCTT TGCGAkGTAA 
251 CGATAAAATG GTCGAGCAGC GACACATCGA CCAGCGACAT TGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAArGTTCsG GCGAGCCGcC 

3 51 CGgATGGTTG TGCGCAATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 408; ORF 1 14>: 

mll4 . pep 

1 MAS ITS PLHG AHRECSKTFL CPPGGTSIGR SM SVTVGLFC VSINLTISV E 

51 YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 



mll4/gl!4 90.0% identity over a 140 aa overlap 

10 20 30 40 50 60 

mll4 . pep MAS I TS PLHGAHRECSKTFLCP PGGTS I GRSMS VTVGLFCVS INLT I S VE YGXSGYF I RA 

||MIIIIII|::|||MIIIIIMIhllllMIIIIIMIIIMIII!ll I 1 1 1 II 

gll4 MAS I TS PLHGAQQECSKTFLCPPGGTSMGRSMS VTVGLFCVS INLT I SVE YGQSGYFTRA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml!4 . pep AACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGEPPGWL 

I III lllhillllhhl I I II Nihil I I hill II I I Mill I I III II I II 

gll4 AECKTGCQG I S PSCLNE RTVCEVTI KWS S SETS TSDMACASRLVNMMS S CEGSGE P PGWL 

70 80 90 100 110 120 



130 140 
ml 14 .pep CAI IRLSAYSSNASLTI SRMX 

I II I Mi III III hi ! MM 

gll4 CAI IRLSAYSSNASLTI SRMX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 409>: 

all4 . seq 

1 ATGCCGGAGG CAAGCATCGC CTCCATCACT TCGCCGCTGC ACGGGGCGCA 

51 ACAGGAATGC AGCAAGACTT TTTTATGTCC GCCGGGCGGG ACGAGTATGG 

101 GGCGGTCAAT GTCGGTAACG GTAGGTTTGT TTTGTGTTTC CATTAACTTA 

151 ACGATATCTG TCGAATACGG TTGAAGCGGC TATTTTATCA GAGCCGCCGC 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTGCCTG AACGAACGGA 

2 51 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC ATCGACCAGC 

301 GACATTGCCT GTGCCAGCCG CCTTGTGAAC ATGATGTCTT CCTGCGAAGG 

351 TTCGGGCGAG CCGCCCGGAT GGTTGTGCGC GATAATCAGG CTGTCGGCAT 

4 01 ATTCGTCCAA TGCCAGTTTG ACAATTTCAC GGATGTAA 
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This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 
all 4 . pep 

1 MPEASIASIT SPLHGAQQEC SKTFLCPPGG TSMGRSM SVT VGLFCVSINL 
51 TISV EYG*SG YFIRAAACKT GCQGISPSCL NERTVCAVTI KWSSSDTSTS 
101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 

mll4/all4 92.9% identity in 140 aa overlap 

10 20 30 40 50 

ml 14 .pep MASITSPLHGAHRECSKTFLCPPGGTSIGRSMSVTVGLFCVSINLTISVEYGXSG 
: I I I I I I I II 1 : : I I I I I I I I I M M I : ! M I M I 1 I i I I! I ! I ! I I I I I I I E M 
all 4 MPEASIASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGXSG 

10 20 30 40 50 60 

60 70 80 90 100 110 

mil 4 .pep YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGE 
I I II E II I II 1111:111111:1:1 I I I I I M I I 11 I i I I I I I I I I I I i I I I M ill 
all4 YFIRAAACKTGCQG I SPSCLNERTVCAVTI KWSSSDTSTS DIACASRLVN MMSSCEGSGE 

70 80 90 100 110 120 

120 130 140 

mll4 .pep PPGWLCAIIRLSAYSSNASLTISRMX 
I I I I II I I I I I I I I I I I I I I I I I I I I 
all 4 PPGWLCAIIRLSAYSSNASLTISRMX 

130 140 



The following partial DNA sequence was 


gll7 . seq 






1 


atggtcgaog 


aactcgacCT 


51 


TGCCGACATC 


GGACGCTACG 


101 


GCTGCAACAG 


CACCGTCGCC 


151 


AAACTTACCC 


ACTTCGCCCG 


201 


CGCACAGCAA 


G CGGAAACC A 


251 


Acatccgcgt 


cgtaTTAATC 


301 


ttTTtaaGCA 


ACGCCCCCGA 


351 


aacccTCGAC 


ATCTTCGCCC 


401 


TCAAATGGCA 


GCTCGAAGAT 


451 


T AC CGCGAAA 


TCGCCCTGCT 


501 


ATACATCGAA 


AACTTCCTCG 


551 


ATATCCACTT 


TGAAGTCGCC 


601 


AAAAAAATGG 


TGAAGAAAAA 


651 


CGCCGTGCGG 


ATTCTGGTCG 


701 


gcaTCGTCCA 


CAGCCTCTGG 


751 


ATCGCCAACC 


CCAAAGgcaA 


801 


cggcccGGAa 


gacaaaggtg 


851 


accAATTCaa 


CgaatTcggT 


901 


ggcaaaggcg 


attccGCCtA 


951 


CTTGGACTGG 


CGCGAAAATA 


1001 


CCGCCTTCAA 


AACCGAGCTT 


1051 


CACGGCAAAG 


TCCTCTCTCT 


1101 


CTACGCCCTG 


CACAGCAGCA 


1151 


AaggGCAGAT 


TGTGCCGCTG 


1201 


GAAATcatta 


cCGCcaaAGA 


1251 


AGGctgGGtc 


aAATCCGGCA 


1301 


GCCAGcaaAa 


cgCcgaCACC 


1351 


AAGCAGCTTG 


CCAAACTCAC 


1401 


aaATCTCGGC 


tacaaAAAGC 


1451 


gcgaaatttc 


caaccgcgcc 


1501 


ccgccccCCG 


TGCCCGTCAG 



identified in N. gonorrhoeae <SEQ ID 41 1>: 



GCTGCCCGAT 
TCCCCGATTG 
GAGCTGGTCA 
GGTGGACAGC 
TG CGGAAAAT 
AAACTGGCGA 
CAGCCCTGAA 
CGCTCGCCAA 
TTGGGCTTCC 
TTTGGACGAA 
ATATCCTGCG 
GGCCGTCCGA 
ACTCAGCTTC 
ATACCGTCCC 
CAGCCCATTC 
CGgttATAAA 
tggaaGtgCA 
gtcgccgCCC 
cgaacaaAAA 
TGGCGGAAAG 
TTCAACGACA 
GCCAACGGGC 
TcggCGACCG 
TCCACCCCGC 
AGGGCATCCT 
AGGCCATCGG 
GTGCGCGAAG 
GCCCAAACCC 
cagaagacct 
atCcaaaaag 
CGCAACCACC 



GCCGTCGCCG 
GAACCTATTG 
AAGGTGtgga 
CTCGCCACGC 
GCTGCTGGCg 
TGCGTacgcg 
AAACgcgccG 
CCGCTTGGGC 
GCCATCAAGA 
AAACGCACCG 
TACGGAACTC 
AACACATCTA 
GACGgccTGT 
CGaGTGTTAC 
CCGGCGagtt 
AG t TTGC ACA 
AATCCGCACC 
ACTGGCGtta 
ATcgccTggt 
CGGCAAGGAA 
CGATTTATGT 
GCAACCCCCA 
CTGCCGGGGC 
TCGAAAACGG 
TCCGTCAACT 
caaAATCCGC 
AAGGCCGTGT 
AACCTGCAAG 
ctacacCGCc 
cctgcggcac 
AT CGTCAAAC 



CCACCCTGCT 
GTTTCCGAGC 
CGAAGTGCAG 
CGGAAGAACG 
atggttaccg 
caccCTGcta 
TCgccaaAga 
GTGTGGCAGC 
ACCCGAAAAA 
AACGCCTCGA 
AAAAAATACA 
CTCCATTTAC 
TCGACATCCG 
ACCACGCTGG 
CGAcgactAC 
CCGTCATCGT 
TTCGAtatGC 
caaagaaggc 
TGCgccaACT 
GACCTCGCCG 
TTTGACCCCG 
TCGACTTCGC 
GCGAAAGTCG 
ACAGCGCGTC 
GGCTTTACGA 
GCCTAcatCC 
CCAACTCGAC 
AGCTTgccga 
gtcggacaag 
GCTgaacgaa 
AGTCCAAAAT 
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15 51 CAAAAAAGGT GGCAAAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 

16 01 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 
16 51 GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 
1701 CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 
1751 GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TATCGAAATC 
18 01 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 
1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 
1901 AAG C C AG CAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 
1951 CGCGTCCTCG CCGGCCTCGG CG ATGT C AAA GGCGTATTGA GCGTTACCCG 
2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 412; ORF 1 17.ng>: 

gll7 . pep 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTH FARVD S LATPEERAQQ AETMRKMLLA MVTDIRWLI KLAM RTRTLL 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLDILRTEL KKYNIHFEVA GRPKHI YSIY 

2 01 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 
251 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

3 01 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

3 51 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 

4 01 E I I TAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 
4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 
501 PPPVPVSATT IVKQSKIKKG GKTGVL I DGE DGLMTTLAKC CKPAPPDDIA 
551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 
6 01 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RD LEAS MR FT LEVKQVNDLP 
651 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 41 3>: 

mll7.seq (partial) 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 

51 ACACATCTAC TCCATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 

101 ACGGC CTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 

151 GAGTGTTACA CCACGCTGGG TATCGTCCAC AGCCTCTGGC AGCCCATTCC 

2 01 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 

251 GTTTGCACAC CGTCATCGTC GGCCCGGAAG ACAAAGGCGT GGAAGTACAA 

301 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 

351 CTGg CGTT AC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

4 01 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGGAAAGC 

4 51 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT T CAACG ACAC 

501 GATTTATGTT TTGACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 

551 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 

601 TGCCGCGGTG CGAAAGTCGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 

651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGCCAAAGAA GGGCATCCTT 

701 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 

751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 

801 AGGCCGCGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 

851 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT ACAAAAAGCC AGAAGACCTC 

901 TACACCGCCG TCGGACAAGG CG AAATTTC C AACCGCGCCA TCCAAAAAGC 

951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGC T CATC 

1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 

1101 GCCGCCCGAC GATATTATCG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 

1151 TGCACCGCAA AwyyTkCyCG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

12 01 GAwAAAGTGC TGGACG CAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 

12 51 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

13 01 ACGTATCCGA CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 

13 51 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 

14 01 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 
14 51 GCGTATTGAG CGTTACCCGG CTTTAA 

This corresponds to the amino acid sequence <SEQ ID 414; ORF 1 17>: 

mll7.pep (partial) 
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1. . . . VKLKKYNVHF EVAGRPKHIY SIYKKMVKKK LSFDGLFDIR AVRILVDTVP 

51 ECYTTLGIVH SLWQPIPGEF DDYIANPKGN GYKSLHTVIV GPEDKGVEVQ 

101 IRTFDMHQFN EFGVAAHWRY KEGGKGDSAY EQKIAWLRQL LDWRENMAES 

151 GKEDLAAAFK TELFNDTIYV LTPHGKVLSL PTGATPIDFA YALHSSIGDR 

201 CRGAKVEGQI VPLSTPLENG QRVEI ITAKE GHPSVNWLYE GWVKSNKAIG 

251 KIRAYIRQQN ADTVREEGRV QLDKQLAKLT PKPNLQELAE NLGYKKPEDL 

3 01 YTAVGQGEIS NRAIQKACGT LNEPPPVPVS ETTIVKQSKI KKGG KNGVL I 
351 DGEDGLMTTL AKCCKPAPPD DIIGFVTRER GISVHRKXXX SFQHLAEHAP 

4 01 XKVLDASWAA LQEGQVFAVD IEIRAQDRSG LLRDVSDALA RHKLNVTAVQ 
451 TQSRDLEASM RFTLEVKQVN DLPRVLASLG DVKGVLSVTR L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 1 17 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 17.ng) 
from N. gonorrhoeae: 

mll7/gll7 



10 20 30 

ml 17 . pep VKLKKYNVHFEVAGRPKHI YS I YKKMVKKKL 

M IMMM III M MM I I MM I I M M 
gll7 EKYREIALLLDEKRTERLEYIENFLDILRTELKKYNIHFEVAGRPKHIYS I YKKMVKKKL. 

150 160 170 180 190 200 



40 50 60 70 80 90 

mll7 . pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 

1 1 1 M 1 1 M 1 1 1 1 1 1 1 M I 11 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 1 i I 1 1 

gll7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 



100 110 120 130 140 150 

mil 7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 

1 1 1 1 1 M I M M I 1 1 i I I 1 1 1 E M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I ! 1 1 1 1 II 1 1 M I M I i 1 1 

gll7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 



160 170 180 190 200 210 

mll7 . pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

M II M M MM II III I I IMM M I M I I I MM M I IM II Mill I I I I M IM II 

gll7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 



220 230 240 250 260 270 

mll7 . pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

IMIMI I I MM I MM I I I MM I II I M 1 1 h M 1 1 M I M I M M I I 1 1 1 M I I II 

g 1 1 7 PLSTPLENGQRVE I ITAKEGHPSVNWLYEGWVKSGKAIGKI RAY I RQQNADTVRE EGRVQ 

390 400 410 420 430 440 



280 290 300 310 320 330 

mll7 . pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 

MIMMMMIIIMM I MMMM IMIIMIIM I llllllllll MIMIM I I 

g 1 1 7 LDKQLAKLTPKPNLQELAENIjGYKKPEDLYTAVGQGEI SNRAIQKACGTLNEPPPVPVSA 

450 460 470 480 490 500 



340 350 360 370 380 390 

ml 17 .pep TTIVKQSKIKKGGKNGVLIIX5EDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 

II Mill I I MM I MM II I I III I II MMMM Ml I I I II II M I I I III M I 

g 1 1 7 TTI VKQSKI KKGGKTGVLi I DGEDGLMTTLAKCCKP AP PDDI AGFVTRERG I S VHRKT CPS 

510 520 530 540 550 560 



ml 17 .pep 



400 410 420 430 440 450 

FQHLAEHAPX KVLDAS WAALQEGQV F AVD I E I RAQDRS GLLRDVS DALARH KLNVTAVQT 
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hlllllll I I 1 I I r t I I i E I I 1 1 I i I E I 1 ) I 1 I I I I ! 1 I ! ! I I I I I i I It ! I i ! t I I I 

g 1 1 7 FRHLAEHAPEKVLDASWAALQEGQVFAVDI E I RAQDRSGLLRDVSDALARHKLNVTAVQT 

570 580 590 600 610 620 

460 470 480 490 

ml 1 7 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 

! : I II III MMMI II I I ii I I I I :m! M I i I i I I I I I I 

g 1 1 7 QSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLSVTRLX 
630 640 650 660 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 41 5>: 

all7 . seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 

51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTG GTTTCCGAAC 

101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 

151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 

201 CGCCCAGCAG GCAGAAACTA TGCGGAAAAT GCTGCTQGCG ATGGTTACCG 

251 ACATCCGCGT CGTGTTAATC AAACTGGCGA TGCGTACGCG CACCCTGCAA 

301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCG TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA AC CCG AAAAA 

4 51 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAG AAAAA ACTCAGCTTC GACGGGTTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

7 01 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

7 51 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 
801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

8 51 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 
901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 
951 TTTGGACTGG CGCGAAAACA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

1201 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

1251 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

14 01 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

14 51 GCGAAATTTC CAACCGCGCC AT C C AAAAAG CCTGCGGCAC GCTGAACGAA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC AT CGT C AAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

17 01 CTCTTTCCGA CACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

17 51 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

1801 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

1851 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

1951 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 416; ORF 1 17.a>: 
all7 .pep 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMRK MLLA MVTDIRWLI KLA MRTRTLQ 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLW QPIPGEFDDY 

2 51 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 



WO 99/057280 



333 



PCT/US99/09346 



401 EIITAKEGHP 

4 51 KQLAKLTPKP 

501 PPPVPVSETT 

551 GFVTRDRGIS 

601 RAQDRSGLLR 

651 RVLASLGDVK 



SVNWLYEGWV 
NLQELAENLG 
IVKQSKIKKG 
VHRKTCPSFR 
DVSDALARHK 
GVLSVTRL* 



KSNKAIGKIR 
YKKPEDLYTA 
GKNGVLIDGE 
HLAEHAPEKV 
LNVTAVQTQS 



AYIRQQNADT 
VGQGEISNRA 
DGLMTTLAKC 
LDASWAALQE 
RDLEASMRFT 



VREEGRVQLD 
IQKACGTLNE 
CKPAPPDDIV 
GQV FAVDIE I 
LEVKQVTDLP 



mll7/all7 98.0% identity in 490 aa overlap 

10 20 30 

mll7 . pep VKLKKYNVH FEVAGRPKHI YS I YKKMVKKKL 

: I I I I I : I I I If I I I I M I II I I I ! I I I I I 
all7 EKYRE I ALLLDEKRTERLE YI EN FLN I LRTELKKYN I H FE VAGR PKH I Y S I YKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

mll7 . pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
I I I I I I I I I I M I I I I M I II I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I M 
all7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

mll7 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
I I I I I M M I I I I I I I I M I I I I I I I I I I I I I I ! I M I I I I 1 I I I II M I I I I I M I I I I 
a 117 PEDKGVEVQIRT FDMHQFNEFGVAAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESG 

270 280 290 300 310 320 

160 170 180 190 200 210 

mll7 .pep • KEDLAAAFKTELFNDTIYVLT PHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

I I I I I M 1 I I I I I I I M I I I I 1 I I I i I I I I 1 I I I I I I I I M I I I I II I I t I M I I I I i I I 
all7 KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
330 340 350 360 370 380 

220 230 240 250 260 270 

mll7 -pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
all 7 PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
390 400 410 420 430 440 

280 290 300 310 320 330 

mll7 . pep LDKQLAKLT PKPNLQELAENLGYKKPEDLYTAVGQGE I SNRAI QKACGTLNE PPPVPVSE 

I t I I I I I I I I 1 I I I I I M I I I I I I I I I I I I I t I I I I I I I II I I I I M I I I I I I I I I I I I I 
all7 LDKQLAKLT PKPNLQELAENLGYKKPEDLYTAVGQGE I SNRAI QKACGTLNE PPPVPVSE 

450 460 470 480 490 500 

340 350 360 370 380 390 

mll7 .pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I I I I I I I If I I I I I I I I I I 1 I 1 I I I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I : I 
all7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPS 
510 520 530 540 550 560 

400 410 420 430 440 450 

mll7 . pep FQHLAEHAPXKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
1:1111111 I I I I M M M I I II I M II II I I I I I I I I I I I I I I I M M I I I I I I I I II 
all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 

460 470 480 490 

ml 17 .pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I j I I I I I 
all7 QSRDLEASMRFTLEVKQVTDLPRVLASLGDVKGVLSVTRLX 
630 640 650 660 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 417>: 



gll7-l . seq 

1 ATGACCGCCA 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 



ATTGCGCGAA 
AAAACCTCAT 
GATGCCGCCA 
GGCGCAAATG 
CCCTGCTTGC 
TCCGAGCGCT 
AGTGCAGAAA 
AAGAACGCGC 
GTTACCGACA 
CCTGCAATTT 
CC AAAGAAAC 
TGGCAGCTCA 
C GAAAAAT AC 
GCCTCGAATA 
AAATACAATA 
CATTTACAAA 
ACATCCGCGC 
ACGCTGGGCA 
cgactACATC 
TCATCGTcgg 
GATATGCacc 
AGAAGGCGGC 
GCCAACTCTT 
CTCGCCGCCG 
GACCCCGCAC 
ACTTCGCCTA 
AAAGTCGAAG 
GCGCGTCGAA 
TTTACGAAGG 
TACATCCGCC 
ACTCGACAAG 
TTGCCGAAAA 
GGACAAGGCG 
GAACGAACCG 
CCAAAATCAA 
GGCTTGATGA 
TATTGCCGGC 
CCTGCCCCTC 
GACGCAAGTT 
CGAAATCCGC 
CGCTCGCCCG 
GACTTGGAAG 
CCTCCCGCGC 
TTACCCGGCT 



TCAGCCCGAT 
TGGTTCGACA 
CGGTACCGCA 
CGCCGTATGG 
GTCGACGAAC 
CGACATCGGA 
GCAACAGCAC 
CTTACCCACT 
ACAGCAAGCG 
TCCGCGTCGT 
TTAAGCAACG 
CCTCGACATC 
AATGGCAGCT 
CGCGAAATCG 
CATCGAAAAC 
TCCACTTTGA 
AAAATGGTGA 
CGTGCGGATT 
TCGTCCACAG 
GCCAACCCCA 
cccGGAagaa 
AATTCaaCga 
AAAGGCGATT 
GGACTGGCGC 
CCTTCAAAAC 
GGCAAAGTCC 
CGCCCTGCAC 
GGCAGATTGT 
ATCATTACCG 
CTGGGTCAAA 
AGCAAAACGC 
CAGCTTGCCA 
TCTCGGCTAC 
AAATTTCCAA 
CCGCCCGTGC 
AAAAGGTGGC 
CCACGCTTGC 
TTCGTTACCC 
TTTCCGACAC 
GGGCGGCGTT 
GCCCAAGACC 
CCACAAACTC 
CCAGCATGAG 
GTCCTCGCCG 
TTAA 



TCAAGACACG 
GCTACTGCGC 
TGGTCGCTGG 
CGAGCCGCTG 
TCGACCTGCT 
CGCTACGTCC 
CGTCGCCGAG 
TCGCCCGGGT 
GAAACCATGC 
ATTAATCAAA 
CCCCCGACAG 
TTCGCCCCGC 
CGAAGATTTG 
CCCTGCTTTT 
TTCCTCGATA 
AGTCGCCGGC 
AGAAAAAACT 
CTGGTCGATA 
CCTCTGGCAG 
AAGgc aACGg 
aaaggtgtgg 
ATTCGGTGTC 
CCGCCTACGA 
GAAAATATGG 
CGAGCTTTTC 
TCTCTCTGCC 
AGCAGCATCG 
GCCGCTGTCC 
CCAAAGAAGG 
TCCGGCAAGG 
CGACACCGTG 
AACTCACGCC 
AAAAAGCCAG 
CCGCGCCATC 
CCGTCAGCGC 
AAAACCGGCG 
CAAATGCTGC 
GCGAGCGCGG 
CTTGCCGAAC 
GCAGGAAGGG 
GCTCCGGGCT 
AACGTTACCG 
GTTCACGCTC 
GCCTCGGCGA 



CAAAGCGCGA 
CGCTCTGCCG 
CGCAGGAACA 
CCCGACCACT 
GCCCGATGCC 
CCGATTGGAA 
CTGGTCAAAG 
GGACAGCCTC 
GGAAAATGCT 
CTGGCGATGC 
CCCTGAAAAA 
TCGCCAACCG 
GGCTTCCGCC 
GGACGAAAAA 
TCCTGCGTAC 
CGTCCGAAAC 
CAGCTTCGAC 
CCGTCCCCGA 
CCCATTCCCG 
ttATAAAAGt 
aagtgcAAAT 
GCCGCCCACT 
ACAAAAAATC 
CGGAAAGCGG 
AACGACACGA 
AACGGGCGCA 
GCGACCGCTG 
ACCCCGCTCG 
GCATCCTTCC 
CCATCGGCAA 
CGCGAAGAAG 
CAAACCCAAC 
AAGACCTCTA 
CAAAAAGCCT 
AACCACCATC 
TGCTCATCGA 
AAACCCGCGC 
CATTTCCGTC 
ACGCGCCCGA 
CAAGTGTTCG 
TTTGCGCGAC 
CCGTGCAAAC 
GAAGTCAAAC 
TGTCAAAGGC 



CCCTGCAAGA 
GACAACGATA 
TTATCCTGCC 
TCCTCGGCGC 
GTCGCCGCCA 
CCTATTGGTT 
GTGTGGACGA 
GCCACGCCGG 
GCTGGCGATG 
GTACGCGCAC 
CGCGCCGTCG 
CTTGGGCGTG 
ATCAAGAACC 
CGCACCGAAC 
GGAACTCAAA 
ACATCTACTC 
GGCCTGTTCG 
GTGTTACACC 
GCGagttCGA 
TTGCACACCG 
CCGCACCTTC 
GGCGTTACAA 
GCCTGGTTGC 
CAAGGAAGAC 
TTTATGTTTT 
ACCCCCATCG 
CCGGGGCGCG 
AAAACGGACA 
GTCAACTGGC 
AATCCGCGCC 
GCCGTGTCCA 
CTGCAAGAGC 
CACCGCCGTC 
GCGGCACGCT 
GTCAAACAGT 
CGGCGAAGAC 
CGCCCGACGA 
CACCGCAAAA 
AAAAGTACTG 
CCGTCGATAT 
GTATCCGACG 
CCAGTCCCGC 
AAGTCAACGA 
GTATTGAGCG 



This corresponds to the amino acid sequence <SEQ ID 418; ORF 117-l.ng>: 



gl 17-1 .pep 

1 MTAISPIQDT 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 



DAATPYGEPL 
SERCNSTVAE 
VTDIRVVLIK 
WQLKWQLEDL 
KYNIHFEVAG 
TLGIVHSLWQ 
DMHQFNEFGV 
LAAAFKTELF 
KVEGQIVPLS 
YIRQQNADTV 
GQGEISNRAI 
GLMTTLAKCC 
DASWAALQEG 
DLEASMRFTL 



QSATLQELRE 
PDHFLGAAQM 
LVKGVDEVQK 
LAMRTRTLQF 
GFRHQEPEKY 
RPKHIYSIYK 
PIPGEFDDYI 
AAHWRYKEGG 
NDTI YVLTPH 
TPLENGQRVE 
REEGRVQLDK 
QKACGTLNEP 
KPAPPDDIAG 
QVFAVDIEIR 
EVKQVNDLPR 



WFDSYCAALP 
VDELDLLPDA 
LTHFARVDSL 
LSNAPDSPEK 
REIALLLDEK 
KMVKKKLSFD 
ANPKGNGYKS 
KGDSAYEQKI 
GKVLSLPTGA 
IITAKEGHPS 
QLAKLTPKPN 
PPVPVSATTI 
FVTRERGISV 
AQDRSGLLRD 
VLAGLGDVKG 



DNDKNLIGTA 
VAATLLADIG 
ATPEERAQQA 
RAVAKETLDI 
RTERLEYIEN 
GLFDIRAVRI 
LHTVIVGPEE 
AWLRQLLDWR 
TPIDFAYALH 
VNWLYEGWVK 
LQELAENLGY 
VKQSKIKKGG 
HRKTCPSFRH 
VSDALARHKL 
VLSVTRL* 



WSLAQEHYPA 
RYVPDWNLLV 
ETMRK MLLAM 
FAPLANRLGV 
FLDILRTELK 
LVDTVPECYT 
KGVEVQIRTF 
ENMAESGKED 
SSIGDRCRGA 
SGKAIGKIRA 
KKPEDLYTAV 
KTGVLIDGED 
LAEHAPEKVL 
NVTAVQTQSR 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 419>: 

mll7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 
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51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTTGCTGG CGCAGGAACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

401 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

451 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAAAGCC 

651 CGAAAAATAC CGCGAAATCG CGCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCAACA TCCTGCGCGG TGAACTCAAG 

751 AAATACAATG TCCATTTCGA AGTCGCCGGC CGCCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTCTTTG 

851 ACATCCGCGC CGTGCGAATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGTA TCGTCCACAG CCTCTGGCAG CCC ATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAATCCCA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTACAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAGGGCGGC AAGGGCGATT CCGCCTACGA ACAGAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACGGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

1401 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

1451 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

17 51 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

1851 TATTATCGGC TTCGTTACCC GCGAGCGCGG CATTTCAGTG CACCGCAAAA 

1901 CCTGCCCGTC TTTCCAACAC CTCGCCGAAC ACGCGCCCGA AAAAGTGCTG 

1951 GACGCAAGCT GGGCGGCATT GCAGGAAGGA CAAGTATTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 420; ORF 117-1>: 

mll7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCAALP DNDKNLIGTA WLLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LA MRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQKPEKY REIALLLDEK RTERLEYIEN FLNILRGELK 

251 KYNVHFEVAG RPKHIYSIYK KMVKKKLSFD GL'FDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PI PGEFDDYI ANPKGNGYKS LHTVI VGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTI YVLTPH GKVLSLPTGA TPI DFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIIG FVTRERGISV HRKTCPSFQH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLASLGDVKG VLSVTRL* 

mll7-l/gll7-l 98.2% identity in 737 aa overlap 

10 20 30 40 50 60 

mll7-l .pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I M I I I ! t I I I I I 1 I I I I I I 1 I 1 I I I I I I I I I I I I I I t I I II I I I I I I I I I I I I I I I I 
gll7-l MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWSLAQEHYPADAATPYGEPL 
10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 17-1 .pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I t I I I I I ! I I 1 I I I I I I I I I I I I I 1 I I I I I 
gll7-l PDHFLGAAQMVDELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 17-1 .pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 

II I I t I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I f I I I I I I! I I I I I I I I I I I I I I I I I 
gil7-l LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLI KLAMRTRTLQFLSNAPDSPEK 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREI ALLLDEKRTERLEYIEN 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I i I I I I I I I I 
gll7-l RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYRE I ALLLDEKRTERLEYIEN 

190 200 210 220 230 240 

250 260 270 280 290 300 

mll7-l .pep FLNILRGELKKYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
11:111 I I I I I I : I I I I I t I I I II I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I II I I 
gll7-l FLDILRTELKKYNIHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 17-1 . pep TLGIVHSLWQPI PGEFDDYIANPKGNGYKSLHTVI VGPEDKGVEVQI RTFDMHQFNEFGV 
I I M I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I t I I I : I I I II I I I I I I I I I I I I I I I 
gll7-l TLG I VHSLWQP I PGEFDDY I AN PKGNGYKSLHTVIVGPEEKGVEVQI RTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 17-1 .pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
I I I I I I I I I I I I I I I I I I I II I I I I M I i I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
gll7-l AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 17-1 .pep GKVLSLPTGATPIDFAYALHSS IGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I I II II I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gll7-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQI VPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 1 7 - 1 . pep VNWLYEGWVKSNKAI GKI RAY I RQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I I I I I I I I I : I I I II I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
g 1 1 7 - 1 VNWLYEGWVKSGKAI GKI RAYI RQQNADT VREEGRVQLDKQLAKLTPKPNLQELAENLG Y 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1 .pep KKPEDLYTAVGQGEI SNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLI DGED 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : t I I I ! I I I 
gll7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKTGVLI DGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 17 - 1 . pep GLMTTLAKCCKPAPPDDI IGFVTRERGI SVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I I 
g 117-1 GLMTTLAKCCKPAPPDDI AGFVTRERGISVHRKTC PS FRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 .pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
g 1 1 7 - 1 QVFAVD I EI RAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 

670 680 690 700 710 720 

730 

mll7-l .pep VLASLGDVKGVLSVTRLX 
I I I : I I I I I II II I I II I 
gll7-l VLAGLGDVKGVLSVTRLX 
730 



mll7-l/RelA 
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sp I P55133 |RELA_VIBSS GTP PYROPHOSPHOKINASE (ATP : GTP 3 ' -PYRO PHOSPHOTRANSFERASE) (PPGPP 
SYNTHETASE I) >gi I 537617 (U13769) ppGpp synthetase I [Vibrio sp.] Length = 744 
Score = 536 bits (1366), Expect = e-151 

Identities = 288/685 (42%), Positives = 432/685. (63%), Gaps = 31/685 (4%). 

Query: 7 4 LDLLPDAVAATLLADI GRYVPDWNLLVSERCNSTVAELVKGVDEVQKLTHFARVDSL 130 

L+ D+ALL + GY D +E +T+ LV+GV+++ ++ ++ S 
Sbjct: 68 LSMDADTLIAALLYPLVEGGCYSTD ALKEEYSGTI LHLVQGVEQMCAI S QLKST 121 

Query: 131 ATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEKRAVAKETLDI 190 

A +A Q + +R+MLL+MV D R V+IKLA R L+ + + PD +RA A+E +1 

Sbjct: 122 AEETAQAAQVDN I RRMLLSMVDDFRC VV I KLAERI CNLREVKDQPDEV- RRAAAQECAN I 180 

Query: 191 FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIENFLNILRGELK 250 

+APLANRLG+ QLKW++ED FR+Q P+ Y++IA L E+R +R +YI + F++ L +K 
Sbjct: 181 YAPLANRLGIGQLKWEIEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 251 KYNVHFEVAGRPKHI YSI YKKMVKKKLS FDGLFDIRAVRILVDTVPECYTTLGI VHSLWQ 310 

N+ EV GRPKHIYSI++KM KK L FD LFD+RAVRI+ + + +CY LG+VH+ ++ 
Sbjct: 241 ASNI RAEVQGRPKHI YSIWRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGVVHTKYR 300 

Query: 311 PIPGEFDDYIANPKGNGYKSLHTVI VGPEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEG- 369 

+P EFDDY+ANPK NGY+S+HTV++GPE K +E+QIRT MH+ +E GVAAHW+YKEG 
Sbjct: 301 HLPKEFDDYVANPKPNGYQSIHTVVLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 — GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPHGKVLSLP 427 

G SAY++KI WLR+LL W+E M++SG ++ ++++F+D +Y TP G V+ LP 

Sbjct: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG — EMLDELRSQVFDDRVYAFTPKGDWDLP 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPSVNWLYE- 486 

+ ATP+DFAY +HS +G RC GAKVEG+IVP + L+ G +VEIIT KE +PS +WL 
Sbjct: 419 SNATPLDFAYHIHSEVGHRCIGAKVEGRIVPFTYHLQMGDQVEIITQKEPNPSRDWLNPN 478 

Query: 487 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL — TPKPNLQELAENLGYKKP 54 3 

G+V S++A K+ A+ R+Q+ D G+ L+ +L K+ T K A+ K P 

Sbjct: 479 LGFVTSSRARAKVHAWFRKQDRDKNIIAGKEILEAELVKIHATLKDAQYYAAKRFNVKSP 538 

Query: 544 EDLYTAVGQGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E+LY +G G++ N+ I +N+P + + K S+ KK ++ V 

Sbjct: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 598 

Query: 595 LI DGEDGLMTTLAKCCKPAPPDDI IGFVTRERGI SVHRKTCPSFQHLAEHAPEKVLDASW 654 

+++G D LMT LA+CC+P P DDI GFVT+ RGISVHR C + L HAPE+++D W 
Sbjct: 599 WEGVDNLMTHLARCCQPI PGDDIQGFVTQGRGI S VHRMDCEQLEELRHHAPERI I DTVW 658 

Query: 655 AALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQ — SRDLEASMRFTLEV 712 

G + + + + A +R+GLL+++++ L K+ V ++++ + + M F LE+ 
Sbjct: 659 GGGFVGN-YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 

Query: 713 KQVNDLPRVLASLGDVKGVLSVTRL 737 

+ L RVL + VK V RL 
Sbjct: 718 TDLEVLGRVLKRI EQVKDVAEAKRL 7 42 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 42 1>: 

all7-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCAC CGCGCTGCCG AACAACGATA 

101 AAAAACTTGT CTTAGCCGCC CGTTCGCTGG CGGAAGCACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAACCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTTAATA TCCTGCGTAC GG AAC T C AAA 

751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGGTTGTTCG 
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8 51 ACATCCGCGC CGTGCGGATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACACTGGGCA TTGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 ■ CGACTACATC GCCAACCCGA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCGCACT GGCGTTACAA 

1101 AGAGGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTAC 

1151 GCCAACTTTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACAGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

13 51 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGTGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 
1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

15 51 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 
1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 
1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC C AAAAAGC C T GCGGCACGCT 
1701 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 
17 51 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 
1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 
1851 CATTGTCGGC TTCGTTACCC GCGATCGCGG CATTTCGGTA CACCGCAAAA 
1901 CCTGCCCCTC TTTCCGACAC CTCGCCGAAC ACGCGCCCGA AAAAGT AC T G 
1951 GACGCAAGTT GGGCGGCGTT GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 
2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 
2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 
2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTTACCGA 
2151 CCTCCCACGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 
2201 TTACCCGGCT TTAA 



This corresponds to the amino acid sequence <SEQ ID 422; ORF 1 17-l.a>: 

all7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCTALP NNDKKLVLAA RSLAEAHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAMRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLNILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PI PGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 LAAAFKTELF NDTI YVLTPH GKVLSLPTGA TPI DFAYALH SSIGDRC RGA 

4 51 KVEGQIVPLS TPLENGQRVE I ITAKEGHPS VNWLYEGWVK SNKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKIKKGG KNGVLIDGED 

601 GLMTTLAKCC KPAPPDDIVG FVTRDRGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVTDLPR VLASLGDVKG VLSVTRL* 

all7-l/mll7-l 97.7% identity in 737 aa overlap 



10 20 30 40 50 60 

ml 17-1 .pep MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
i I I I I I I I I I I I I I I M I I! I M I I I : I M : I I I : I : : I I I : I I I I I I I I I I I I I I 
all7-l MTAISPIQDTQSATLQELREWFDSYCTALPNNDKKLVLAARSLAEAHYPADAATPYGEPL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 17-1 .pep PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
I I I I I I I I i I I I I I I M I I i I I 1 I I I It I I I I I I I I I I I I I I M I I I I I I I I I I I It I i I 
all7-l PDHFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 17-1 .pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
I I I I I I I I I I II It I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
a 1 1 7 - 1 LTHFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLI KLAMRTRTLQFLSNAPDS PEK 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 17-1 .pep RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQKPEKYREIALLLDEKRTERLEYIEN 
I I I I 1 I I I I I I I I I I I I M M I I I I I I I I I I I I I I : I I I I I I I I I i I I I i I I I It I I 11 I 
all7-l RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALLLDEKRTERLEYIEN 

190 200 210 220 230 240 
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250 260 270 280 290 300 

mll7-l .pep FLNILRGELKKYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 
| | | | I I I I I I I I : I! I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I ! I I M I I I I I 1 i 
all7-l FLNILRTELKKYNIHFEVAGRPKHIYSI YKKMVKKKLSFDGLFDIRAVRILVDTVPECYT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 17-1 .pep T LG I VHSLWQP I PGEFDDYI AN PKGNGYKSLHT VI VGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t ! I I I I I 
al!7-l TLG I VHSLWQP I PGEFDDY I AN PKGNGYKSLHTV I VGPEDKGVEVQIRTFDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 17-1 .pep AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 
I I I I I I I I I I I I I I I I I I I ! I I I I I I I II I I I I I I I 1 I I I I I ! I I I I I I I I I I I I I I I I I 
all7-l AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

mll7-l .pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 
I I I I I I I I I II I I I II I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I 
all7-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 17-1 .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I j I I I I I I I I I I I I I t 
a 1 1 7 - 1 VNWLYEGWVKSNKAI GKI RAY I RQQNADT VREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

mll7-l .pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 
! I I I I I I I I I I I I I I I I I I t I It I I I I I I I I I I I I I I I I M I I I I M I I I I I I M I I I I I 
all7-l KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKNGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

mll7-l .pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I I I I I I I I I I I I I I I I I : II I I I : I I I I I I I I I I I I ! : II I I I I I It II I 1 I I I I I i I I 
all7-l GLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 . pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I I I I I I I It II I I I I I I I I I I I t I I I I I 1 I I I I I I I I I I I I I I I I I I I I I : I I I I 
a 117-1 QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVTDLPR 
670 680 690 700 710 720 

730 

ml 17-1. pep VLASLGDVKGVLSVTRLX 
I 1 I I I I I I I I I 1 I I t I I I 
all 7-1 VLASLGDVKGVLSVTRLX 
730 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 

gll8 . seq 

1 ATGTGCGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGCTG GAGAATGATT TAATcgaGGT TAGGAGAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

2 51 CTTCCCCTTG GTTGCCTGAT AGCGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAaCA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 424; ORF 1 18.ng>: 

gll8 . pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 

51 YPYPMDI PRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 425>: 

ml 18 . seq 

1 , ATGTGTGAGT TCAAGGATAT TATAAGAAAC GTTCCTTATT TTGAGGGGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGTTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CGATGGACAT ACCAAGATAT GTTGTCATTG GAATCGGTAC 

201 CATTATTGAT TTCTTAATGG TTCCAAATTG GAAACTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 426; ORF 1 1 8>: 

ml 18. pep 

1 MCEFKDIIRN VPYFEGYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRY VVIGIGTIID FLMVPNW KLF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 118 shows 92.8% identity over a 125 aa overlap with a predicted ORF (ORF 1 18.ng) 
from N. gonorrhoeae: 

mll8/gll8 



10 20 30 40 50 60 

ml 18 . pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDI PRY 

MINI: I I : I II I I M I I I I I I I I I I I M II I II I I I I I 1 I I M : I 1 I M I I I I I 
gll8 MCEFKDFRRNIPCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRRKYPYPMDIPRD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 18 .pep VVIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I I I I I I I I I I I I I I I : I II I I I I I I M I I I II i I I I I I I I I I I I I I M I I I I I I I I I I 
gll8 IVIGIGTI IDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 

70 80 90 100 110 120 

ml 18. pep YYNKKX 
I I I I I I 

gll8 YYNKKX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 427>: 
all8 . seq 

1 ATGTGTGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAATTG GAGAATGATT TAATCGAGGT TAGAAAAAAA 

151 TATCCTTATC CG AT G GAT AT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGTGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

351 GCGATTTGAT TATTACAACA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 428; ORF 1 1 8.a>: 
all8.pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRKK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYER 

101 FTTMLRYIFT EKDIVNVRFD YYNKK* 



mll8/all8 93.6% identity in 125 aa overlap 

10 20 30 40 50 60 

ml 18 .pep MCEFKDIIRNVPYFEGYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDI PRY 

MINI: I I " I II II I I I I I I I I M i I I I I I I I I I I I I I I I II I M I II I I I M I I 
all 8 MCEFKDFRRNI PCFEEYDENSFIGKWYDDGVWDDEEYWKLENDLIEVRKKYPYPMDI PRD 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml 18 .pep WIGIGTIIDFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYIFTEKDIVNVRFD 
: I I I I I I I M I 1 I II I I : I M I I I I M I I I M M I I M I I I! 1 j I I I I I I I I I M I ! I I I 
a 118 IVIGIGTIIDFLMVPNWELFEIKAS PWLPDSVGIHERYERFTTMLRYI FTEKDIVNVRFD 

"?0 80 90 100 110 120 



ml 18. pep YYNKKX 
I I I I I I 

all8 YYNKKX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 429>: 

gl20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAAGGCTACC CCAATCCGCC GTGCTG CACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAATCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTGCCTACT 

251 ATAAAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

3 01 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 
351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 
401 CGAAACTCCC CCCGGGTCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGCC TGAATAAGGC GGGTACGGGA AAAT AC AG CA Taggcggcgt 
501 gGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATACGGTAA 
551 CGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 
601 ACCGAcgaCG G C AAAACCT A TACGCTGAAG CTCAAATCGG TGCAGATCAA 
651 CGGACAGGCC GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAY KI VST IK VPLYNIRFES GGTWGNT LH PAYYKDI RRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLP PGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 43 1>: 

ml20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGmACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC GACGATTAAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCcAA ATTCGCCGAC 

301 GGCAGCGTAA CTTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

401 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

451 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCG 

This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

ml20 .pep 

1 MMKTFKNIFS AAILSAALPC AYA AGLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNT LH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA AND AKL P PGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYF FAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 120 shows 97.3% identity over a 223 aa overlap with a predicted ORF (ORF 120.ng) 
from TV. gonorrhoeae: 

ml20/gl20 

10 20 30 40 50 60 

ml 2 0 . pep MMKTFKNI FSAAI LSAALPCAYAAGLPQSAVLXYSGS YGI PATMTFERSGNAYKI VSTI K 

1 1 1 ! I I 1 1 I 1 1 1 1 1 ] I I I I I J I 1 1 lllllll IIMIIIIIMIIMIIIIIIIMIII 

9 1 2 0 MMKTFKNI FSAAILS AALPCAYAARLPQSAVLHYSGSYGI PATMTFERSGNAYKI VSTI K 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

IIIIIIIIIIIINIIIIMI:||:|IMIIMIIIIIIIIMMIIIIIIIIIIIIIII 

gl2 0 V P L YN I R F E S GGTWGNT LH PAY Y KD I RRG KL Y AE AKF ADG S VT Y G KAGE S KT EQ S P KAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml2 0 . pep DLFTLAWQLAANDAKLPPGLKI TNGKKLYSVGGLNKAGTGKYS I GGVETEWKYRVRRGD 

MIIIIMIIIIMIIIMIIIIIIIIIMIIIIIIIMIMMIMIIIMIIIIIIM 

gl2 0 DLFTLAWQLAANDAKLPPGLKI TNGKKLYSVGGLNKAGTGKYS I GGVETEWKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml20 - pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKP 

hi I Ml M I Ml I MM I I I M Ml I III I Ml II Mil M 

g 1 2 0 DTVTYFFAPSLNNI PAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 43 3>: 
al20 . seq 

1 ATGATGAAGA CTTTTAAAAA TATATTTTCC GCCGCCATTT TGTCCGCCGC 

51 CCTGCCGTGC GCGTATGCGG CAGGGCTGCC CCAATCCGCC GTGCTGCACT 

101 ATTCCGGCAG CTACGGCATT CCCGCCACGA TGACATTTGA ACGCAGCGGC 

151 AATGCTTACA AAATCGTTTC G AC G ATT AAA GTGCCGCTAT ACAATATCCG 

201 TTTCGAGTCC GGCGGTACGG TTGTCGGCAA TACCCTGCAC CCTACCTACT 

251 ATAGAGACAT ACGCAGGGGC AAACTGTATG CGGAAGCCAA ATTCGCCGAC 

301 GGCAGCGTAA CCTACGGCAA AGCGGGCGAG AGCAAAACCG AGCAAAGCCC 

351 CAAGGCTATG GATTTGTTCA CGCTTGCCTG GCAGTTGGCG GCAAATGACG 

4 01 CGAAACTCCC CCCGGGGCTG AAAATCACCA ACGGCAAAAA ACTTTATTCC 

4 51 GTCGGCGGTT TGAATAAGGC GGGTACAGGA AAATACAGCA TAGGCGGCGT 

501 GGAAACCGAA GTCGTCAAAT ATCGGGTGCG GCGCGGCGAC GATGCGGTAA 

551 TGTATTTCTT CGCACCGTCC CTGAACAATA TTCCGGCACA AATCGGCTAT 

601 ACCGACGACG GCAAAACCTA TACGCTGAAA CTCAAATCGG TGCAGATCAA 

651 CGGCCAGGCA GCCAAACCGT AA 

This corresponds to the amino acid sequence <SEQ ED 434; ORF 120.a>: 

al20 .pep 

1 MMKTFKNI FS AAILSAALPC AYA AGLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

ml20/al20 99.6% identity in 223 aa overlap 

10 20 30 40 50 60 

ml20.pep MMKTFKNI FS AAILSAALPC AY AAGLPQSAVLXYSGSYG I PATMTFERSGNAYKIVSTIK 

I M I I I I I I f I i I I I I I I I | | | | I | | M | | ! | | | | | | | | | | M I I I I I I I I I I I I 1 I M 
a!2 0 MMKT FKNIFSAAILSAALPCAYAAGLPQSAVLHYSGSYGIPATMT FERSGNAYKIVSTIK 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml20 . pep " VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAK FADGSVTYGKAGESKTEQSPKAM 
I I I I I M I I I I M I I I I M M I ! j I [ I I I I I I I I I 1 I I I M I M i j I j I I M 1 I I I I I II 
al20 VPLYNIRFESGGTWGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 .pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 
I I M I I M I II I I M I I I I I I I I I I | | | M | 1 f | M I I I I I I I I M I I I I M I t I I I I M 
a!2 0 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEWKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml 20 .pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
I M M I I I I I I I I I I I I II I I I I I I M I I I I I | | | | M I I I I M 
a 120 DAVMYFFAPSLNNI PAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following 

gl21 . seq 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 43 5>: 



ATGGAAACAC 
GGCGGATGCC 
AAGGGCACGC 
GATTTGCAGG 
GCAAGAACTC 
GTCAAAACCT 
ACCGTCCGAC 
GCCGCTGCTG 
GCCGCGACCT 
CACGAAGCCC 
CGGCGGGATT 
GCTTCGACAC 
cacTGGcagc 
catatTGCcg 
AACCCcaccc 
gaaacctAcc 
ttcccgattc 
CAGATGCCCG 
TTAATGGCGG 
CACCGCCGAA 
cgtggttggC 
GCGACCGGCG 
A 



AGCTTTACAT 
GTGCTGGTAC 
CTTTACCCCC 
ACACAGGCAC 
AGCCGCCTGT 
CGCTCCGTGC 
ACGCGCCGGA 
GCGGAACTGa 
TGCTGCCGGC 
TGTTCCGCGA 
GCCAACATCA 
AGGGCCGGGC 
TGCCTTACGA 
CAACTGCTCG 
aaAAAGCACG 
ttgacggcgg 
accgcgcaaA 
TCAAATGTAC 
ATTTGGCAGA 
CTGAACCTCG 
GGCGTGTTGG 
CATCCAAACC 



CGGCATTATG 
GGATGGACGG 
TACCCTGACC 
AGACGAACTG 
ACGCGCAAAC 
GACATTACCG 
ACACGGTtac 
cgcggatttT 
GGacaAGGTG 
TGACAGGGAA 
GCGTACTCCC 
AATATGCTGA 
CAAAAacggt 
gcaggctGCT 
GGgcGCGaac 
cgaaaaccga 
ccgTttggga 
ATTTGCGGCG 
ATGTTTCGGC 
ATCCTCAATG 
ATTAACCGCA 
GTGTATTCTG 



TCGGGAACCA 
CGGCAAATGG 
GGTTGCGCCG 
CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AG CAT AC AG C 
TACCGTCggc 
CGCCGCTCGT 
ACACGCGTGG 
CCCCGGCGCA 
TGGAcgcgtg 
gcAAAGgcgg 
CGCCcaccCG 
TgtttgcccT 
tacgacgtat 
cgccgtctca 
GCGGCATCCG 
ACACGCGTTT 
GGTGGAGGCG 
TTCCCGGTAG 
GGCGCGGGAT 



GTATGGACGG 
CTGGGCGCGG 
CAAATTGCTG 
GGATGTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
gacttcCGCA 
CCCCGCCTTT 
TACTGAACAT 
CCCGCCTTCG 
gacgcaggca 
cacAAGGCAA 
TATTTCTCAC 
AAattggctc 
tgcggacgct 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
gccgCATTtg 
TCCGCACAAA 
ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 436; ORF 121.ng>: 

g!21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY+ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 437>: 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 
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151 GATTTGCAGG 

2 01 GCAAGAACTC 

2 51 GTCAAAACCT 
301 ACCGTCCGAC 

3 51 GCCGCTGCTG 

4 01 xxxxxxxxxx 
4 51 xxxxxxxxxx 
501 xxxxxxxxxx 
551 xxxxxxxxxx 
601 xxxxxxCAGC 
651 CATATTGCCG 
701 AACGCCACCC 

7 51 GAAACCTACC 

8 01 TTCCCGTTTT 
8 51 CAGATGCCCG 
901 TTAATGGCGG 
951 CACCGCCGAC 

1001 CGTGGTTGGC 

1051 GCAACCGGCG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 438; ORF 121>: 

ml 2 1 . pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

Computer analysis of the amino acid sequences gave the following results: 

Homology with a predicted ORF from N. meningitidis menA with menB 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF12Lng) 

from N. gonorrhoeae: 

ml21/gl21 

10 20 30 40 50 60 

ml 21 . pep METQLYIGIMSGT SMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I I It I I I I I I I I I I I I I M 1 I I : 1 I I I I t I I It I I I It I I I I I I I I : I i I II I I I : I I I 
g 1 2 1 METQLY I GIMSGT SMDGADAVLVRMDGGKWLGAEGHAFT P Y PDRLRRKLLDLQDTGT DE L 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : I I I I It I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M M 1 I I I I I I I I I I I I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
gl21 AELTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGN I LPQLLDRLLAHP YFAQRHPKST 

: : I I I I I I M I I : M I I I I I I M I I I I I I I I : \ I I I I I 

gl21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 

I M I I I : I I I I I I I I I I I I I I I I I I I I I | | I I | | M I I I M I I I I I I I I I I I I 

gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



ACACAGGCGC 
AGCCGCCTAT 
CGCACCGTCC 
ACGCGCCGGA 
GCGxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
TTCCTTACGA 
CAACTGCTCG 
TAAAAGCACG 
TTGACGGCGG 
ACCGCGCAAA 
TCAAATGTAC 
ATTTGGCAGA 
CTGAACCTCG 
GGCGTGTTGG 
CATCCAAACC 



AGACGAACTG 
ATGCGCAAAC 
GACATTACCG 
ACACGGTTAC 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
CAAAAACGGT 
ACAGGCTGCT 
GGGCGCGAAC 
CGAAAACCGA 
CCGTTTGCGA 
ATTTGCGACG 
ATGTTTCGGC 
ATCCGCAATG 
ATTAATCGCA 
GTGTATTCTG 



CACCGCAGCA 
CGCCGCCGAA 
CCCTCGGCTG 
AGCATACAGC 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
GCAAAGTCGG 
CGCCCACCCG 
TGTTTGCCAT 
TACGACGTAT 
CGCCGTCTCA 
GCGGCATCCG 
ACACGCGTTT 
GGTGGAAGCC 
TTCCCGGTAG 
AnCGCGGGAT 



GGATTTTGTC 
CTGCTGTGCA 
CCACGGGCAA 
TTGCCGATTT 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
xxxxxxxxxx 
CACAAGGCAA 
TATTTCGCAC 
AAATTGGCTC 
TGCGGACGCT 
CACGCAGCGG 
CAATCCTGTT 
CCCTGCACAG 
GCCGnATTTG 
TCCGCACAAA 
ATTATTATTG 
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310 320 330 340 350 360 

ml21 .pep LHADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRI PGSPHKATGASKFCIL 

I | I I I I M I I I I I I M I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I t I I I I 
g 121 LMADLAE C FGTRV S LHSTAELNLDPQWVEAAAFAWLAACW I NR I PGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 439>: 



al21 . seq 




1 


AT GG AAAC AC 


51 


GGCGGATGCC 


101 


AAGGGCACGC 


151 


GATTTGCAGG 


201 


GCAAGAACTC 


251 


GTCAAAACCT 


301 


ACCGTCAGAC 


351 


GCCGCTGCTG 


401 


GCCGCGACCT 


451 


CACGAAGCCC 


501 


CGGCGGGATT 


551 


GCTTCGACAC 


601 


CACTGGCAGC 


651 


CATATTGCCG 


701 


AACCCCACCC 


751 


GAAACCTACC 


801 


TTCCCGATTC 


851 


CAGATGCCCG 


901 


TTAATGGCGG 


951 


CACCGCCGAA 


1001 


CATGGATGGC 


1051 


GCAACCGGCG 


1101 


A 



AGCTTTACAT CGGCATCATG 
GTACTGATAC GGATGGACGG 
CTTTACCCCC TACCCCGGCA 
ACACAGGCGC GGACGAACTG 
AGCCGCCTGT ACGCGCAAAC 
CGCGCCGTCC GACATTACCG 
ACGCGCCGGA ACACAGTTAC 
GCGGAACGGA CTCAGATTTT 
TGCGGCCGGC GGACAAGGCG 
TGTTCCGCGA CGACAGGGAA 
GCCAACATCA GCGTACTCCC 
AGGACCGGGC AATATGCTGA 
TTCCTTACGA CAAAAACGGT 
CAACTGCTCG ACAGGCTGCT 
TAAAAGCACG GGGCGCGAAC 
TTGACGGCGG CGAAAACCGA 
ACCGCGCAAA CCGTTTTCGA 
TCAAATGTAC ATTTGCGGCG 
ATTTGGCAGA ATGTTTCGGC 
CTGAACCTCG ATCCGCAATG 
GGCGTGTTGG GTCAACCGCA 
CATCCAAACC GTGTATTCTG 



TCGGGAACCA GCATGGACGG 
CGGCAAATGG CTGGGCGCGG 
GGTTACGCCG CAAATTGCTG 
CACCGCAGCA GGATGTTGTC 
CGCCGCCGAA CTGCTGTGCA 
CCCTCGGCTG CCACGGGCAA 
AGCGTACAGC TTGCCGATTT 
TACCGTCGGC GACTTCCGCA 
CGCCGCTCGT CCCCGCCTTT 
ACACGCGCGG TACTGAACAT 
CCCCGACGCA CCCGCCTTCG 
TGGACGCGTG GATGCAGGCA 
GCAAAGGCGG CACAAGGCAA 
CGCCCACCCG TATTTCGCAC 
TGTTTGCCCT AAATTGGCTC 
TACGACGTAT TGCGGACGCT 
CGCCGTCTCA CACGCAGCGG 
GCGGCATCCG CAATCCTGTT 
ACACGCGTTT CCCTGCACAG 
GGTAGAAGCC GCCGCGTTCG 
TTCCCGGTAG TCCGCACAAA 
GGCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 440; ORF 121. a>: 

al21.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

2 51 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 



ml21/a!21 



74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I II I I I M I I I I I I ! I I I I I I I I I M I II ! 1 I ! i It I I M I I I M : I I I I I II I I ! I I 
al21 METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml21 .pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I I I I : ! I I I I I I I I I I I I I M I I I I I II I I M I II M I I I I I II II I i : I I : I I I I I I I I 
a l21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 
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346 



130 140 150 160 170 180 

.121. pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

! : : : 
121 AERTQIFTVGDFRSRDLAAGGQGAPLV PAFHEAL FRDDRETRAVLNIGGIANI SVLPPDA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 21 . pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: M I I I II I 11 : i M 1 I i I I i i I I I I i I 1 I I M Mill 

a 121 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 21 -pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I M : I I I M I I I M I II I I I I I I t I I I I I I I I M 11 I II I I I I I I I I I I I I M I I 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I M I I I I I M M I I I I I I : I M I I I I I I I I II I : I I 1 I : I I I I I I I I I I I I t I I M II 
a l21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml21.pep XAGYYYX 
I I I I I I 

a!21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 441 >: 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGNATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG ANCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ED 442; ORF 121-1>: 

ml21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AERTRI FT VG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI AMI SVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 95.6% identity in 366 aa overlap 



10 20 30 40 50 60 
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ml 2 1 - 1 pep METQLYI GIMSGTSMDGADAVLI RMDGGKWLGAEGHAFT PYPGRLRRQLLDLQDTGADEL 

| || | | | | | | | | | I M II I I II I : I II i M I I I M I I I I t I I : I I M I I I I : I I t 

gl2 l METQLYI GIMSGTSMDGADAVLVRMDGGKWLGAEGHA FT PYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 

70 60 90 100 110 120 

ml21-l pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
| | | | : | | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I ! M I I I 
g 121 HRSRMLSQELSRLYAQTAAELLCSQNLAPCDITALGCHGQTVRHAPEHGYSIQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
I | | | | | | | | I | | I ! M I I I I I I I I I ( I I I I t I I I I ! : I I M : I I I I I II I I If I I I I I 
gl2 l AELTRI FTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRVVLNIGGI ANISVLPPGA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
M I I I I I I IE I E I I I I I I I I I I I I I I II I I I I ! I I I I I M I I I I I II I I I I : I II I I I 1 
g l21 pAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
| | | | | I I I I I I I I I I I I I 1 I I I I I I I I M I I I I 1 I I I I I I M I I t I I I I I I I I I I I I I I 
g l2l GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l.pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
| | j | | | | | I I I I I I I I I I I : I I I II I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I M 1 I 
gl2 l LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml21-l -pep XAGYYYX 
I I I I I I 

gl21 GAGYYYX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 443>: 

al21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGC AC GC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 
801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

8 51 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ED 444; ORF 121-1. a>: 

al21-l.pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 
51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 
101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 
151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
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201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ml21-l/a!21-l 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

ml2 1 - 1 pep METQLYIGIMSGTSMDGADAVLI RMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 

I | | ( I I I I I I I I I I I I I I 1 t I I II 1 I M I I I I I I I I II E I : I I I I I I I I I I I I 

al21-l METQLYIGIMSGTSMDGADAVLI RMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l .pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I 1 I I : I I I I I I I I I I I I ! I ! I I I 1 I I I I I I I I I I I I I I I I I I I I i I I I : I i : I 1 I I I I I 1 
a 12 1-1 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
| | I I : I 1 I I 1 j I I 1 I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I 
a 121-l AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGN I LPQLLDRLL AH P YFAQPHPKST 
M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I 
a 12 1-1 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMY ICGGGIRNPV 
| | [ | | ! | | | M I I I I I M I I M I I I I I 1 I I I I It I I I I I I I I I I I I II I I I I I I M M I 
al21-l GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMY ICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRI PGSPHKATGASKPCIL 
| | | 1 | | I I I t I I II I I I I I : I I I ! I I I I I I I I I I : I I I I : I I I 1 I I I I I I I I I I I I I I I 
a 121 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWMAACWVNRI PGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l.pep XAGYYYX 
I I I I I I 

al21 GAGYYYX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 445>: 

gl22 . seq 

1 ATGGCTT T AC TGAGCATCCG CAAGCTGCAC AAACAATACG GCAGCGTAAC 

51 CGCCATCCAA TCCTTAGACT TGGACTTGGA AAAAGGCGAA GtcatCGTAC 

101 TGCTGGGCCC gTccggctgc ggCAAATCCA CCCTcctgcg ctgcgtcaaC 

151 GGTTTGG AG C CGCACCAagg cgGCAGCATC GTGATGGACG GTgtcgGCGA 

2 01 ATTCggcAAA GACGTTTCCT GGCAAACCGC CCGGCAAAAa gtcggtatgg 
251 tctttcaaag taacgAactg Tttgcccaca tgaccgtcat cgAaaacatc 
301 ttcttAggcC CGGT AAagg a aCAAAAcCgc gaccgtgccg aagcaGAGGC 

3 51 gCAAGCCGGC AAactGttgg aacgcgTCGG actgctAGAC CGCAAAAACG 

4 01 CCTATCCGCG CGAACTTTCC GGCGGTCAGA AACAGCGCAT CGCCATTGTC 

4 51 CGCGCCCTGT GCCTGAATCC GGAAGTCATC CTGCTGGACG AAATCACCGC 
501 CGCACTTGAC CCCGAAATGG TGCGCGAAGT CTTGGAAGTG GTTTTGGAAC 

5 51 TCGCCCGCGA AGGGATGAGT ATGCTCATCG T AAC CCACGA AATGGGGTTC 
601 GCACGCAAAG TTGCCGACCG CAT CGTCTTT ATGGACAAAG GCGGCATCGT 
651 CGAATCGTCC GACCCCGAAA CCTTTTTTTC CGCACCAAAA AGCGAACGCG 
701 CCCGCCAATT TCTGGCAGGT ATGGACTACT GA 

This corresponds to the amino acid sequence <SEQ ID 446; ORF 122.ng>: 
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gl2 2 . pep 

1 MALLS IRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

51 GLEPHQGGSI VMDGVGEFGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FLGPVKEQNR DRAEAEAQAG KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

201 ARKVADRIVF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 447>: 

ml22 . seq 

1 GTTGTCATGA TTAAAATCCG 

51 TATTTTGCGC GG CAT CG ATT 

101 TCCTCGGGCC TTCCGGCTCA 

151 GCGTTGGAAA TGCCCGAAGA 

201 GCTGAAAATC GATTTTTCTA 

251 TGCGCCGCAA ATCAkGCATG 

3 01 AAAACCGCCT TGGAAAACGT 

3 51 GCCTGCCGCC CAAGCGCGCG 

4 01 GCTTGGGCGA CAAAGTGGAT 
4 51 CAGCAGCGCG TCGGCATTGC 
501 GCTGTTTGAC GAACCGACTT 
551 TTTTGGATmC CAT GAAGG AA 
601 GTTACGCATG AAATCAAGTT 
651 GATGGACrGC GGCGTTATTG 
701 ACCACCCCAA ACACGAACGG 
751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml22 . pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ VWILGPSGS GKTTFLRCLN 
51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 
101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 
151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 
201 VTHEIKFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 
251 TKI* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from AT. gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 122.ng) 
from N. gonorrhoeae: 

ml22/gl22 



CAATATCCAT 
TGGATGTGTG 
GG C AAAACGA 
CGGACAAATC 
AAAAACCAAG 
GTGTTTCAAC 
AATGGAAGGA 
AAGAGGCTCT 
TTGT AT CCCT 
CCGCG CATTG 
CCGCGCTCGA 
TTGGCGCAAG 
CGCCTTAGAA 
TCGAACAAGG 
ACGCGGAGAT 



AAGACCTTTG 
CAAAGGGCAG 
CGTTTCTGCG 
GAGTTCGACA 
CAAACACGAT 
AATACAAyCT 
CCGGTTGCCG 
GAAACTGCTG 
ACCAGCTTTC 
GCGATTCAGC 
TCCTGAATTG 
AAGGCTGGAC 
GTGGCAACCA 
CAGCCCGCAA 
TTTTAAGCCA 



GCGAAAACAC 
GTGGTCGTCA 
ATGCCTAAAC 
ACGAGCGACC 
ATTTTGGCAC 
CTTTCCGCAC 
TACAgGGCAA 
G AAAAAGT CG 
CGGCGGTCAG 
CTGAACTGAT 
GTG C AAGATG 
CATGGTTGTC 
CCGwCGTCGT 
GATTTGTTCG 
AATCCAATCT 



10 20 30 40 50 60 

ml22 .pep WMIKIRNIHKTFGENTILRGIDLDVCKGQVWILGPSGSGKTTFLRCLNALEMPEDGQI 

:: = = :||s:|| :| I = = = = 111= 11 = 1 = 1 = 11111 I I : I = I I I = I = I I = hi 
gl22 MALLSIRKLHKQYGSVTAIQSLDLDLEKGEVIVLLGPSGCGKSTLLRCVNGLEPHQGGSI 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml22 . pep EFDNERPLKI DFSKKPSKHD I LALRRKSXMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

= 1= : ||:: hi I I I I = : I I I h = I I = = I I I h= I 

gl22 VMDGVGEFGKDVSWQTA RQKVGMVFQSNELFAHMTVI ENI FLGPVKEQNRDRA 

70 80 90 100 110 



130 140 150 160 170 180 

ml22 . pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 

= 1= =1 lllhlll 1= = II :||llhlh:hlll ::|h = hll 1 = 11111 = 
gl22 EAEAQAGKLLERVGLLDRKNAY PRELSGGQKQRI AI VRALCLNPEVI LLDE I TAALDPEM 

120 130 140 150 160 170 



190 200 210 220 230 240 

ml22 . pep VQDVLDXMKE LAQEGWTMVWTHE I KFALEVATTXWMDXGV I VEQGS PQDLFDH PKHER 

h = lh = llhll :| = :||lh II =11 I II I 111^ = 1= = 1 = II II 

gl22 VREVLEWLELAREGMSMLIVTHEMGFARKVADRIVFMDKGGIVESSDPETFFSAPKSER 
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180 190 200 210 220 230 

250 

ml22.pep TRRFLSQIQSTKIX 

= 1 = 11 = 

gl22 ARQFLAGMDYX 
240 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 449>: 

al22 . seq 

1 GTTGTCATGA TTAAAATCCG C AAT AT C CAT AAGACCTTCG GCAAAAATAC 

51 CATTTTGCGC GG CATC AAT T TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 

101 TCCTCGGGCC TTCCGGCTCA GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 

151 GCGTTGGAAA TGCCCGAAGA CGGACAAATC GAGTTCGACA ACGAGCGACC 

2 01 GCTGAAAATC GATTTTTCTA AAAAACCAAG CAAACACGAT ATTTTGGCAC 

2 51 TGCGCCGCAA ATCAGGCATG GTGTTTCAAC AATACAACCT CTTTCCGCAC 

3 01 AAAACCGCCT TGGAAAACGT GATGGAAGGA CCGGTTGCCG TACAGGGCAA 

3 51 GCCTGCCGCC CAAGCGCGCG AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 

4 01 GCTTGGGCGA CAAAGTGGAT TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 
4 51 CAGCAGCGCG TCGGCATTGC CCGAGCATTG GCGATTCAGC CCGAGCTGAT 
501 GTTGTTTGAC GAACCCACTT CCGCGCTTGA CCCCGAGTTG GTGCAAGACG 
551 TGTTGAACGC CATGAAGGAA TTGGCGCGGG AAGGTTGGAC GATGGTCGTC 
601 GTTACCCACG AAATCAAGTT CGCGCTGGAA GTTGCCACGA CCGTTGTCGT 
651 GATGGACGGC GGCGTTATCG TAGAGCAGGG CAGCCCGAAA GAGTTGTTCG 
7 01 ACCACCCCAA ACACGAACGG ACGCGGAGAT TTTTAAGCCA AATCCAATCT 
7 51 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 450; ORF 122.a>: 

al22.pep 

1 WMIKIRNIH KTFGKNTILR GINLDVCKGQ VVVILGPSGS GKTTFLRCLN 

51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSGM VFQQYNLFPH 

101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 

151 QQRVGIARAL AIQPELMLFD EPTSALDPEL VQDVLNAMKE LAREGWTMVV 

201 VTHEIKF ALE VATTVWMDG GVIV EQGSPK ELFDHPKHER TRRFLSQIQS 
251 TKI* 

m!22/al22 96.0% identity in 253 aa overlap 

10 20 30 40 50 60 

ml22 . pep WMIKIRNIHKTFGENTILRGIDLDVCKGQWVILGPSGSGKTTFLRCLNALEMPEDGQI 
I I 1 M I I II I I I N : I I I I I t I : I I I 11 I i I I I I I I I I I I I I 1 I I I I II I I I I 1 I ! M I I 
al22 VVMIKIRNIHKTFGKNTILRGINLDVCKGQVWILGPSGSGKTTFLRCLNALEMPEDGQI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml22 . pep EFDNERPLK I DFSKKPSKHD I LALRRKSXMVFQQYNLF^HKTALENVMEG PVAVQGKPAA 

I I I I I I t I I I I I I I I I I I I I I I M I I I I I I I I I I I I 11 I I I I I I I I I M I I I I M I I I I 
al22 EFDNERPLKI DFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml22 . pep QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 
I I I I I I ! I i 1 I I I I I I I M I I I I I I I I I It I I I II M I I 11 I I I I I I I I I 1 I I 1 I I I I I I 
a 122 QAREEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPEL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml22 . pep VQDVLDXMKELAQEGWTMVVVTHEIKFALEVATTXVVMDXGVIVEQGSPQDLFDHPKHER 
Mill: I I I I I : I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I :: I I I I I I I I I 
a l22 VQDVLNAMKELAREGWTMVWTHEIKFALEVATTVWMDGGVIVEQGSPKELFDHPKHER 

190 200 210 220 230 240 



m!22 .pep 



250 

TRRFLSQIQSTKIX 
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351 



al22 



I 1 I I M I I II I I M 
TRRFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 1>: 



1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

4 01 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

451 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

7 51 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 452; ORF 122-1. ng>: 

gl22-l .pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMVWT 

2 01 HEIKFTLEVA TNVWMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 453>: 

ml 22-1 . seq 

1 ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACTATTTT 

51 GCGCGGCATC GATTTGGATG TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCAGGCAAA ACGACGTTTC TGCGATGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGAGC GACCGCTGAA 

201 AATCGATTTT TCTAAAAAAC CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAATCAGG CATGGTGTTT CAACAATACA ACCTCTTTCC GCACAAAACC 

301 GCCTTGGAAA ACGTAATGGA AGGACCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 

4 01 GCGACAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

4 51 CGCGTCGGCA TTGCCCGCGC ATTGGCGATT CAGCCTGAAC TGATGCTGTT 

501 TGACGAACCG ACTTCCGCGC TCGATCCTGA ATTGGTGCAA GATGTTTTGG 

551 ATACCATGAA GGAATTGGCG CAAGAAGGCT GGACCATGGT TGTCGTTACG 

601 CATGAAATCA AGTTCGCCTT AGAAGTGGCA ACCACCGTCG TCGTGATGGA 

651 CGGCGGCGTT ATTGTCGAAC AAGGCAGCCC GCAAGATTTG TTCGACCACC 

7 01 CCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTACCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ED 454; ORF 122-1>: 

ml22-l .pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVVVT 
201 HEIKFALEVA TTWVMDGGV IVEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

ml22-l/gl22-l 94.8% identity in 251 aa overlap 



gl22-l . seq 



10 20 30 40 50 60 



m!22 - 1 .pep 



gl22-l 



MIKIRN I HKTFGENTI LRGI DLDVCKGQVVVI LGPSGSGKTTFLRCLNALEMPEDGQI EF 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MI K I RNI HKTFGENTI LRGI DLDVGKGQWVI LG PSGSGKTT FLRCLNALEMPEDGQIEF 
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10 



20 



30 



40 



50 



60 



70 80 90 100 110 120 

ml22-l pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
M HCIIIIII I I i i I I I I I I I I I I f I I I t I I I I I I I ■ I I I I I I I I I I I I I I I I I I I 
gl22-l DNARPLRIDFSKKTSKHDILALRRKSGMVFQQYNLFPHKTVLENVMEGPVAVQGKPAAQA 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml2 2-l -pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
I | 1 1 | 1 | I | I I I t I I I I I I I I I I I I I I I I I ! M 1 I I I M I I I I I M I M ! i I I I I I I I I I 
gl22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml22-l . pep DVLDTMKELAQEGWTMVVVTHEIKFALEVATTVVVMDGGVIVEQGSPQDLFDHPKHERTR 
| | | |: I I I I I : f I i I I I I I II I I I I : I I I I I : I I I I I I I I I I I I I I I : - I ! I I I I I I I I 
gl22-l DVLDAMKELAREGWTMVVVTHEIKFTLEVATNVVVMDGGVIVEQGSPKELFDHLKHERTR 
190 200 210 220 230 240 



ml22-l .pep 
gl22-l 



250 

RFLSQIQSTKIX 

I I I I I I I I : 1 I I 
RFLSQIQSAKIX 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 45 5>: 



al22-l . seq 

1 ATGATTAAAA 



51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 



GCGCGGCATC 
GGCCTTCCGG 
GAAATGCCCG 
AATCGATTTT 
GCAAATCAGG 
GCCTTGGAAA 
CGCCCAAGCG 
GCGACAAAGT 
CGCGTCGGCA 
TGACGAACCC 
ACGCCATGAA 
CACGAAATCA 
CGGCGGCGTT 
CCAAACACGA 
ATTTGA 



TCCGCAATAT 
AATTTGGATG 
CTCAGGCAAA 
AAGAC GGAC A 
TCTAAAAAAC 
CATGGTGTTT 
ACGTGATGGA 
CGCGAAGAGG 
GGATTTGTAT 
TTGCCCGAGC 
ACTTCCGCGC 
GGAATTGGCG 
AGTTCGCGCT 
ATCGTAGAGC 
ACGGACGCGG 



CC AT AAGAC C 
TGTGCAAAGG 
ACGACGTTTC 
AATCGAGTTC 
CAAGCAAACA 
CAACAATACA 
AGGACCGGTT 
CTCTGAAACT 
CCCTACCAGC 
ATTGGCGATT 
TTGACCCCGA 
CGGGAAGGTT 
GGAAGTTGCC 
AGGGCAGCCC 
AGATTTTTAA 



TTCGGCAAAA 
GCAGGTGGTC 
TGCGATGCCT 
GACAACGAGC 
CGATATTTTG 
ACCTCTTTCC 
GCCGTACAGG 
GCTGGAAAAA 
TTTCCGGCGG 
CAGCCCGAGC 
GTTGGTGCAA 
GGACGATGGT 
ACGACCGTTG 
GAAAGAGTTG 
GCCAAATCCA 



ATACCATTTT 
GTCATCCTCG 
AAACGCGTTG 
GACCGCTGAA 
GCACTGCGCC 
GCACAAAACC 
GCAAGCCTGC 
GTCGGCTTGG 
TCAGCAGCAG 
TGATGTTGTT 
GACGTGTTGA 
CGTCGTTACC 
TCGTGATGGA 
TTCGACCACC 
ATCTACCAAG 



This corresponds to the amino acid sequence <SEQ ID 456; ORF 122-1. a>: 

al22-l.pep 

1 MIKIRNIHKT FGKNTILRGI NLDVCKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLNAMKELA REGWTMVVVT 

2 01 HEIKFALEVA TTVVVMDGGV IVEQGSPKEL FDHPKHERTR RFLSQIQSTK 

251 I* 



al22-l/ml22-l 



97.2% identity in 251 aa overlap 



10 20 30 40 50 60 

al22-l .pep MIKIRNIHKTFGKNTILRGINLDVCKGQVWILGPSGSGKTTFLRCLNALEMPEDGQIEF 
| | | | | | | | | | | I : I I I I I II : I I I I II I I I I I I I I I I I i I I t II I I I I I I M I I I I I I M 
ml22-l MI KI RN I HKTFGENTI LRGI DLDVCKGQVVVI LGPSGSGKTTFLRCLNALEMPEDGQI EF 

10 20 30 40 50 60 

70 80 90 100 110 120 

al22-l .pep DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 
M t I I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I I I i I t I I I I I I M I I I I I I I I 
m 122-l DNERPLKIDFSKKPSKHDILALRRKSGMVFQQYNLFPHKTALENVMEGPVAVQGKPAAQA 

70 80 90 100 110 120 



130 140 150 160 170 180 

a 122-1 . pep REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
I I I I I I I I I I t I I I I I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
ml22-l REEALKLLEKVGLGDKVDLYPYQLSGGQQQRVGIARALAIQPELMLFDEPTSALDPELVQ 
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130 140 150 160 170 180 

190 200 210 220 230 240 

al22-l pep DVLNAMKELAREGWTMVVVTHEIKFALEVATTVVVMDGGVIVEQGSPKELFDHPKHERTR 
| | !:: I H I I : I I I I M I I I I t I I I ! i I I I I I i I I I I I I 1 i I I I I I I - I M I I I It I I I 
ml22-l DVLDTMKELAQEGWTMVVVTHEIKFALEVATTVVVMDGGVI VEQGSPQDLFDHPKHERTR 

190 200 210 220 230 240 

250 

al22-l .pep RFLSQIQSTKIX 
I I I I I I I I I I I I 
ml22-l RFLSQIQSTKIX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 7>: 

gl25 - seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCGGTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

2 01 CGGACGCAGC TCGATGGAAA GTGTGCGCCT GTCGTTCGGC AAATGCGGTT 

2 51 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGG C CGG CTGGACGGCG 

301 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACggc gaATCCTTTG TCTGGTGGGC ATTGGCAAAC GGCGCACTGA 

4 01 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 

451 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 

501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 

551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 

601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGG CAAC 

651 CCTGACGGCA ACGCTCGCCT ATACGCTGAC GGGCTGCTGG ATGTATGCCT 

701 TGGGTTTGGC GGCGGCTCTG TTTACCGGAG AAAG CGACGT GGCGAAAATC 

751 CTGTTGGGCG CGGGCTTGGG CATAACGGGC ATTCTGG C AG TCGTCCTCTC 

801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 

851 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 

901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 

951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttttg 

1001 attgccgaCT TTTttgtctt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 459>: 

ml25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

2 01 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

2 51 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

3 01 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 

3 51 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

4 01 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

5 51 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 

6 01 CTTGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 

6 51 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 

7 01 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 

7 51 CTGGGCGCAr GTTTGgGTGC GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 

8 01 CGTTACCACA ACGTTTCTCG ATGCCTATTC CGCCGGCGCG AGTGCGAACA 
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8 51 ACATTTCCGC GCGTTTTGCG GAAACACCCG TCGCTGTCrG CGTTACCCTG 

901 ATCGGCACGG TACTTGCCGT CATGCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCGGgC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

This corresponds to the amino acid sequence <SEQ ID 460; ORF 125>: 

ml25 . pep 

1 MSGNASSPSS SSAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGGALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMI YAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSML LMLLAV LWLSAEVF ST AGSTAAQVSD GMS FGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RP FAATLT AT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAXLGAAGI LAWLSTVTT TFLDAYSAGA SANNISARF A ETPVAVXVTL 

3 01 IGTVLAVM LP VTEYENFLLL IGSVFAPMAG GFDCRLFRLE TA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 125 shows 92.1% identity over a 343 aa overlap with a predicted ORF (ORF 125.ng) 

from N. gonorrhoeae: 
ml25/gl25 

10 20 30 40 50 60 

MSGNAS S P S S S S A I GL I V? FGAAVS I AE I S TGTLLAPLGWQRGLAALLLGHAVGGALF FAA 

MMM M M MMMMMM MM M M M M MMM ,M M MM I MM M MM 

MSGNAS SPSSSAA I GLVWFGAAVS I AE I STGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
10 20 30 40 50 60 



ml 2 5 -pep 
gl25 



70 80 90 100 110 120 

ml 2 5 pep AY I GALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVM I YAGATVS SALGKVLWDG 

I 1 1 I II I I M 1 1 1 1 1 I II 1 1 1 II I II II II I 1 1 1 1 1 1 1 I 1 1 1 h I II I I I M I I 1 1 1 1 I 

g!2 5 AYIGALTGRSSMESVRLSFGKCGSVLFSVANMLQLAGWTAVMIYVGATVSSALGKVLWDG 

70 80 90 100 110 120 



130 140 150 160 . 170 179 

ml 2 5 pep E S FVWW ALANGAL I VLW LV FGARKTGGL KTV S MLLMLLAVLW L S AE V F S TAGS TAAQ - V S 

||||||IIIIIIMIIIMIM|:|IIIIIMMMIMIMIhllh::|::M II 
gl 2 5 ESFVWWALANGALIVLWLVFGT^RRTGGLKTVSMLLMLIiAVLWLSVEVFASSGTNAAPAVS 

130 140 150 160 170 180 



180 190 200 210 220 230 239 

ml2 5 . pep DGM S F GT AV ELS AVM P L S W LP LAAD YTRHARR P F AAT LT ATLA YT LTGCWM Y ALG LAAAL 

' , : ! , M 1 1 I 1 1 1 1 > ! I i : 1 1 1 1 ; 1 1 1 : . 1 1 i I ! 1 1 1 1 M i 1 1 1 1 1 I i 1 i i 1 1 I I ! 1 1 1 

g 1 2 5 DGMTFGTAVELSAVMPLSWLPLAADYTRQARRPFAATLTATLAYTLTGCWMYALGLAAAL 

190 200 210 220 230 240 



240 250 260 270 280 290 299 

ml25 . pep FTGE TDV AK I LLG AXLG AAG I LA WLS TVTTT FLD A Y S AGAS ANN I S AR F AE T P V AVX VT 

1 1 I I I I I I II I 1 1 I II MIIIIMMIIIIIIhlMIMIIIIIIIIII MM II 

gl 2 5 FTGE TDV AKI LLG AGLG I TG I LAWLS TVTTT FLDT Y S AG AS ANN I SARFAEI PVAVGVT 

250 260 270 280 290 300 



300 310 320 330 340 

ml 2 5 . pep L I GTVLAVML P VT E Y EN FLLL I GS V F APMAGG FDCRL FRLET AX 

II MMMMMMMIMM I I h I I I I I 1 1 I II I hill 

g 1 2 5 LI RTVLAVMLPVTEYKNFLLLI RSVFGPMAGGFDCRLFCLKTAX 

310 320 330 340 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 46 1>: 

al25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACACTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTGCT TTTGGGTCAT 
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151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

2 51 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG " GCAAAGTGTT 

351 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

4 01 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 

501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

551 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC TTGGCTGCCG 

601 CTGGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 

651 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 

7 01 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 

7 51 CTGGGCGCAG GTTTGGGTGC GGCAGGCATT TTGGCGGTCG TCCTGTCGAC 

801 CGTTACCACC ACTTTTCTCG ATGCCTACTC CGCCGGCGTA AGTGCCAACA 

851 ATATTTCCGC CAAACTTTCG GAAATACCCA TCGCCGTTGC CGTCGCCGTT 

901 GTCGGCACAC TGCTTGCCGT CCTCCTGCCC GTTACCGAAT AT G AAAACT T 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCG . GC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 



This corresponds to the amino acid sequence <SEQ ID 462; ORF 125.a>: 

al25 .pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAPLGWQ RGLAALLLGH 

51 AVGG ALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

2 01 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGET DVAKIL 

251 LGAGLGAAGI LAVVL STVTT TFLDAYSAGV SANNISAKLS E IPIAVAVAV 

301 VGTLLAVL LP VTEYEN FLLL IGSVFAPMAX GFD CRLFRLE TA* 

m!25/al25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml25 .pep MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

| | | M I I M I I : I I I I I I I I I I I I I II M I I I t I I I I I I I I I I I I I I I I I I I I I I M M I 
a l25 MSGNASSPSSSAAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml25 . pep AYIGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 
I I I I I M I I t I I I I I I 1 I j I I I I M I I I I I! I I I I I I M I I I I I I I II I I I I II I II E I 1 
a 12 5 AY I GALTGRS S ME SVRLS FGKRG S VLFS VANMLQLAGWTAVM I YAGATVS SALGKVLWDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml25 . pep ESFVWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFST AGSTAAQVSD 

I I | I I II I ! I II I I I I I M I I I I I M I I I I 1 I I I I I I I I I M I I I M I I I I I I 

a 12 5 E S EVWW ALANG AL I VLWLVFGARKTGGLKT VSMLLMLLAV LWLSAEVFST AGSTAAQVSD 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 2 5 . pep GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 
I I I I I I I I i I I I I I i I I I I I I It I M I I I I I I I M I I I N I M I I I I I M I I I M i I 1 I I 
a 12 5 GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml25 . pep TGETDVAKILLGAXLGAAGILAWLSTVTTTFLDAYSAGASANNISARFAETPVAVXVTL 

I I | I If I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I : I I M I I I : : : I I : I I I : : 
a 12 5 TGETDVAKILLGAGLGAAGILAWLSTVTTTFLDAYSAGVSANNISAKLSEIPIAVAVAV 

250 260 270 280 290 300 



310 320 330 340 

ml2 5 . pep IGTVLAVMLPVTEYEN FLLLIGSVFAPMAGGFDCRLFRLETAX 

: I I : I I I : I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
a 125 VGTLLAVLLPVTEYENFLLLIGSVFAPMAXGFDCRLFRLETAX 
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310 320 330 340 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 463>: 

gl26 . seq 

1 AtgccgtcTG AAaccCcaaa ggcACGCCGC CGGCTTTCAG ACGGCATCGC 

51 GTCCGACAAC CAT AC C AAAG AAT C CATC AT GCTCACCctg tacggcGAAA 

101 CTTTCCCTTC GCGGCTGCTg ctcggcacgG cggcctacCC GACCCCTGAA 

151 ATCCTCAAAC AATCCGTCCG AACCGCCCGG CCCGCGATGA ttaccGTCTC 

201 GCTGCGCCGC ACGGGATGCG GCGGCGAGGC GCACGGTCAG GGGTTTTGGT 

2 51 CGCTGCTTCA AGAAACCGGC GTTCCCGTCC TGCCGAACAC GGCAGGCTGC 

3 01 CAAAGCGTGC AGGAAGCGGT AACGACGGCG CAAATGGCGC GCGAAGTGTT 
3 51 TGAAACCGAT TGGATAAAAT TGGAACTCAT CGGCGACGAC G AC AC CTTGC 
401 AG CCGG ACGT GTTCCAACTC GTCGAAGCGG CGGAAATCCT GATTAAAGAC 
451 GGCTTCAAAG TGCTGCCTTA TTGCACCGAA GACCTGATTG CCTGCCGCCG 
501 CCTGCTCGAT GCGGGCTGTC AGGCGTTGAT GCCGTGGGCG GCTCCCATCG 
551 GCACGGGTTT GGGGGCGGTT CACGCCTATG CGCTCAAAAT CCTGCGCGAA 
601 CGCCTGCCCG ACACGCCGCT GAT TAT CG AC GCGGGCTTGG GTTTGCCTTC 
651 CCAAGCGGCA CAAGTGATGG AATGGGGTTT TGACGGCGTA TTGTTAAACA 
701 CCGCCGTTTC CCGCAGCGGC GACCCCGTCA ACATGGCGCG CGCCTTCGCA 
751 CTCGCCGTCG AATCCGGACG GCTGGCATTT GAAGCCGGGC CGGTCGAAGC 
801 G CG AACC AAA GCCCAAGCCA GCACGCCGAC AGTCGGACAA CCGTTTTGGC 
851 ATT CGGCGG A ATATTGA 



This corresponds to the amino acid sequence <SEQ ID 464; ORF 126.ng>: 

gl26.pep 

1 MPSETPKARR RLSDGIASDN HTKESIMLTL YGETFPSRLL LGTAAYPTPE 

51 I LKQSVRTAjR PAMITVSLRR TGCGGEAHGQ GFWSLLQETG VPVLPNTAGC 

101 QSVQEAVTTA QMAREVFETD WIKLELIGDD DTLQPDVFQL VEAAEILIKD 

151 GFKVLPYCTE DLIACRRLLD AGCQALMPWA AP I GTGLGAV HAYALKILRE 

201 RLPDTPLIID AGLGLPSQAA QVMEWGFDGV LLNTAVSRSG DPVNMARAFA 

251 LAVESGRLAF EAGPVEARTK AQASTPTVGQ PFWHSAEY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 465>: 

ml26.seq (partial) 

1 . . CACTATACAA AGGAACCCAT TATGCTCACC CTATACGGCG AAACTTTCCC 

51 CTCGCGGCTG CTGCTCGGCA CGGCTGCCTA CCCGACCCCC GAAATCCTCA 

101 AACAATCCAT CCAAACCGCC CAGCCTGCGA TGATTACCGT CTCGCTGCGC 

151 CGCGCGGGAA GCGGCGGCGA GGCGCACGGT CAGGGGTTTT GGTCGCTGCT 

201 TCAAGAAACC GGCGTTCCCG TCCTGCCGAA CAC GGCAGG C TGCCAAAGCG 

251 TGCAGGAAGC GGTAACGACG GCGCAAATGG CGCGCGAAGT GTTTGAAACC 

301 GATTGGATAA AATTGGAACT CATCGGAGAT GACGACACCT TGCAGCCGGA 

351 TGTGTTCCAG CTTGTCGAAG CGGCGGAAAT C CTG ATT AAA GACGGCTTCA 

4 01 AAGTGCTGCC TTATTGCACC GAAGACCTGA TTGCCTGCCG CCGCCTGCTC 

4 51 GACGCGGGCT GTCAGGCGTT GATGCCGTGG GCGGCCCCGA TCGGCACGGG 

501 TTTGGGCGCG GTTCACGCCT ACGCGTTGAA CGTCCTGCGC GAACGCCTGC 

551 CCGACACGCC GCTGATTATC GACGCGGGCT TGGGTTTGCC CTCACAGGCG 

6 01 GCACAAGTGA TGGAATGGGG CTTTGACGGC GTGCTTTTGA AT ACTGC CGT 

651 TTCCCGCAGC GGCGATCCGG TCAATATGGC ACGCGCCTTC GCACTCGCCG 

701 TCGAATCCGG ACGGC TGGC A TTTGAAGCCG GACCGGTCGA AGCACGCGAC 

751 AAAGCGCAAG CCAGCACGCC GACAGT CGG A CAACCGTTTT GGCATTCGGC 

801 GGAATATTGA 



This corresponds to the amino acid sequence <SEQ ED 466; ORF 126>: 

ml26 .pep (partial) 

1 . . HYTKEPIMLT LYGETFPSRL LLGTAAYPTP EILKQSIQTA QPAMITVSLR 
51 RAGSGGEAHG QGFWSLLQET GVPVLPNTAG CQSVQEAVTT AQMAREVFET 
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101 DWIKLELIGD DDTLQPDVFQ LVEAAEILIK DGFKVLPYCT EDLI ACRRLL 

151 DAGCQALMPW AAPIGTGLGA VHAYALNVLR ERLPDTPLI I DAGLGLPSQA 

201 AQVMEWGFDG VLLNTAVSRS GDPVNMARAF ALAVESGRLA FEAGPVEARD 

251 KAQASTPTVG QPFWHSAEY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from N. gonorrhoeae: 

ml26/gl26 

10 20 30 40 

ml26 pep HYTKEP IMLTLYGETFPSRLLLGTAAYPTPE I LKQS I QTAQ 

-III I I I 1 i i I ! I 1 I I I I I I I I 1 I I I 1 I i II 1 ! 1 = = t ! = 
g i2 6 MPS ETPKARRRLSDGIASDNHTKES IMLTLYGETFPSRLLLGTAAYPTPE I LKQSVRTAR 

10 20 30 40 50 60 

50 60 70 80 90 100 

ml2 6 pep PAMITVSLRRAGSGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

MIMIIIIhl IIIIIIIIMIIMIIMIIIIMIIIIMIIIIIMIIIIIMill 

gl2 6 PAMITVSLRRTGCGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

70 80 90 100 110 120 

110 120 130 140 150 160 

ml2 6 pep WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWA 

1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r i 

gl2 6 W I KLE L I GDDD T LQ PDV FQ L VE AAE I L I KDG F KVL PYCTEDLI ACRRLLD AGCQ ALM P W A 

130 140 150 160 170 180 

170 180 190 200 210 220 

ml26 pep APIGTGLGAVHAYALNVLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

I ' I ! I I I 111 I : ii-l I Ml I III I I I I I I I I 1 I I . i I ' M ' I I : i MM II I I ! 

g i26 APIGTGLGAVHAYALKI LRERLPDTPLI IDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

190 200 210 220 230 240 

230 240 250 260 270 

ml 2 6 . pep DPVNMARAFALAVESGRLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

I Ml M M MM | MIMM IMIMI M M I I I I I i M 1 1 I I M M M 

g i26 DPVNMARAFALAVESGRLAFEAGPVEARTKAQASTPTVGQPFWHSAEYX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 467>: 

al26 . seq 

1 TTGTTAATCC ACTATACAAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAATCCTCAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGCAGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG AT T GG ATT AA ACTCGAACTC ATCGGCGACG ACGACACCTT 

351 GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

4 01 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

4 51 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG CTGATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

701 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

7 51 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 
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This corresponds to the amino acid sequence <SEQ ID 468; ORF 126.a>: 

a 12 6. pep 

1 LLIHYTKEPI MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV 

51 SLRRAGCGGE AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV 

101 FETDWIKLEL IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR 

151 RLLDAGCQAL MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP 

201 SQAAQVMEWG FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE 

251 ARDKAQASTP TVGQPFWHS A EY * 

ml26/a!26 98.1% identity in 269 aa overlap 

10 20 30 40 50 

HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGE 
I I I I I I I I I I I I : M I I ! I I I M ! I I I II I I I I I M M : It I I I I I M I I I III 
LLIHYTKEPIMLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRAGCGGE 
10 20 30 40 50 60 

60 70 80 90 100 110 

AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPD 
t | | M I I I t t f I I II I I I I I I I M I I It I M I I I I I I I I I M I 1 I II I I II I I I I I I I I I 
AHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDW I KLEL IGDDDTLQPD 
70 80 90 100 110 120 

120 130 140 150 160 170 

VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 

M | | | | | | || M I I I I I I I I I I I I II I II I I I I t I I I II I I II M I I I I I I I I I I I I I I I 
VFQLVEAAEILIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALN 
130 140 150 160 170 180 

180 190 200 210 220 230 

VLRERLPDTPLI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 

I | | | I | I I U I It I I I I I I I I I M I I I I I I II I t I I II II I II M I I I I II I I I M I M I 
VLRERLPDTPLI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESG 

190 200 210 220 230 240 

240 250 260 270 

RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

II I M I I I I I I I I I M I I I I I I I I M 1 M I I M 
RLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 469>: 

gl26-l . seq 

1 ATGCTCACCC TGTACGGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 GGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCACGGGATG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG AT TGG AT AAA ATTGGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAC GTGTTCCAAC TCGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

4 01 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ATGCGGGCTG TCAGGCGTTG 

4 51 ATGCCGTGGG CGGCTCCCAT CGGCACGGGT TTGGGGGCGG TTCACGCCTA 

501 TGCGCTCAAA ATCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCT TCCCAAGCGG CACAAGTGAT GGAATGGGGT 

601 TTTGACGGCG TATTGTTAAA CACCGCCGTT TCCCGCAGCG GCGACCCCGT 

651 CAACATGGCG CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG GCCGGTCGAA GCGCGAACCA AAGCCCAAGC CAGCACGCCG 

7 51 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 470; ORF 126-l.ng>: 

gl26-l -pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRTGCGGE 



ml 2 6 . pep 
al26 

ml26 . pep 
al26 

ml2 6 . pep 
al26 

ml2 6 . pep 
al26 

ml26 . pep 
al26 
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51 AHGQGFWSLL QETGVPVLPN 

101 IGDDDTLQPD VFQLVEAAEI 

151 MPWAAPIGTG LGAVHAYALK 

201 FDGVLLNTAV SRSGDPVNMA 

251 TVGQPFWHSA EY* 



TAGCQSVQEA VTTAQMAREV FETDWIKLEL 
LIKDGFKVLP YCTEDLI ACR RLLDAGCQAL 
ILRERLPDTP LIIDAGLGLP SQAAQVMEWG 
RAFALAVESG RLAFEAGPVE ARTKAQASTP 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47 1>: 

ml26-l . seq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

51 GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGAGATG ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

451 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ml26-l -pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLI ACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

2 51 TVGQPFWHSA EY* 



ml26-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

ml2 6-l pep MLTLYGETFPSRLLLGTAAYPTPEILKQS I QTAQPAMITVSLRRAGSGGE AHGQGFWSLL 
I I I I I M j I I I I I I I I I 1 I 1 I 1 I I I 11 I I :: I I : I I I I I I I I I I : I I I I I I I I 1 I I I I I 
gl2 6-l MLTLYGETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 2 6-1 . pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
I I j I I I I I I I I I I I I I I I I I 1 I M I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I 
g 1 2 6- 1 QETGVPVLPNTAGCQS VQEAVTTAQMAREVFETDWI KLELI GDDDTLQPDVFQLVEAAEI 

70 80 . 90 100 110 120 

130 140 150 160 170 180 

ml 2 6-1 . pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
M i I ! I I I I I 1 I I I I I I ! t I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I i I I i I I I 
g 12 6-1 LIKDGFKVLP YCTEDLI ACRRLLDAGCQALMPWAAPIGTGLGAVHAYALKILRERLPDTP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml2 6-l .pep LI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
M I I I I I I I I I I I I I 1 I I I i I I I I I I I 1 11 I I I I I I I I I I I I t I I 1 t I I 1 I I I I I I I I I I 
g 12 6-1 LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
190 200 210 220 230 240 



250 260 
ml26-l.pep ARDKAQASTPTVGQPFWHSAEYX 
II I I I 1 1 I I I 1 I I I I I I II I I I 
g 1 2 6- 1 ARTKAQASTPTVGQPFWHSAEYX 

250 260 

The following partial DNA sequence was identified in W meningitidis <SEQ ID 473>: 

al26-l . seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 
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151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATTAA ACTCGAACTC 

301 ATCGGCGACG ACGACACCTT GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

4 51 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

751 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 

This corresponds to the amino acid sequence <SEQ ID 474; ORF 126- La>: 

al26-l .pep 

1 MLTLYSETFP SRLLLGTAAY PTPEILKQSV RTARPAMITV SLRRAGCGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAP I GTG LGAVHAYALN VLRERLPDTP LIIDAGLGLP SQAAQVMEWG 

201 FDGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAQASTP 

251 TVGQPFWHSA EY* 

al26-l/ml26-l 98.1% identity in 262 aa overlap 

10 20 30 40 50 60 

a 12 6-1. pep MLTLYSETFPSRLLLGTAAYPTPEILKQSVRTARPAMI TVS LRRAGCGGE AHGQGFWSLL 
I I I I I : I I [ I I I ! I I 1 I I I I I I! I I ! I II ::[ I : i !! I I I I I II I I I I I I I I I I I I I I I 
ml2 6-l MLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQPAMITVSLRRAGSGGEAHGQGFWSLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

al2 6-l.pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
1 f I 1 ! I II II I I I I I I II I i I I I I I I M I I I I It M I I I I 11 I ! I I li I I I I It I I I I I I 
ml2 6-l QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
70 80 90 100 110 120 

130 140 150 160 170 180 

al2 6-l.pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
| I I I I I I I M I I I I I I I I i I I I I I I I I I I I I I I I t I 1 I 1 I I I I I I I I I I t I I I I I I I I I I 
ml2 6- 1 LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
130 140 150 160 170 180 

190 200 210 220 230 240 

a 12 6-1. pep LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

I I I I I I I I I I I I I I I I I I I I I I I t I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 2 6-1 LIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
190 200 210 220 230 240 

250 260 
al26-l.pep ARDKAQASTPTVGQPFWHSAEYX 
I I I I 1 I I I I I I I I I I I I I II I I I 
ml2 6-l ARDKAQASTPTVGQPFWHSAEYX 

250 260 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 475>: 

gl27 . seq 

1 ATGGAAATAT GGAATATGTT GAACACTTGG CCCGATGCCG TCCCGATACG 

51 CGCGGAGGCG GCCGAATCCG TGGCGGCGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAAT AT C CACTT CAG AC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATT CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACAAAAGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA aggtCtGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 AC AT CAAT CT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 
4 51 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
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551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GG C AT AC C G C 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GG CGGTT AT G GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCATC 

851 CCGCCGgctC cgAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

gl27 . pep 

1 ME I WNMLNTW PDAVP I RAE A AES VAAVAAL LLARALLLNI HFRRHPDFGI 
51 ESKRRFLVAS RNITLLLVLF SLAFIWSAQI Q TLALSMFAV AAAVWAT KE 



101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF I TPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 477>: 



1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGCGGT TGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGG CAT C 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

2 51 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA AGGTCTGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

4 51 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

5 01 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

6 01 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 
651 CCAACGGsAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

7 01 CCGCCAGACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 

7 51 ATCATCGTCC GCTTCGCTTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

8 01 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCACC 
8 51 CCGCCGGCTC CGAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml27 -pep 

1 MEIWNMLDTW LGAVPIRAEA VES VAAVAAL LLARALLLNI HFKRHPDFGI 



51 ESKRRFLVAS R NITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

2 01 RLKAVLEPLC APYIPAIQRX LENVQAEKLF I TPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 127 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from TV. gonorrhoeae: 

ml27/gl27 



ml27 . seq 



ml2 7 .pep 



10 20 30 40 50 60 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 479>: 

al27 . seq 

1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGTGGT CGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

301 CTGATTATGT GTCTGTCGGG CAGCATTTTA AGGTCTGCCA CCCAGCAATA 

351 CTCGGTCGGC G ACT AT AT CG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 

4 51 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 

501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAC GTCATCCATA 

551 CGGTCGAAAT CCCGGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 CCAACGGCAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

7 01 CCGCCAAACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 

7 51 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAAT CCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATTACC 

851 CCGCCGGCTC CGAAACACTT TAA 

This corresponds to the amino acid sequence <SEQ ID 480; ORF I27.a>: 

al27 . pep 

1 MEIWNMLDTW LGAVPIRAEA VE S V A WAAL LLARALLLN I HFKRHPDFGI 

51 ESKRRFLVAS RN ITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAVWAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRH LENVQAEKLF ITPAAKPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNYPAGSETL * 

ml27/al27 98.6% identity in 290 aa overlap 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 48 1>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTC G AC A ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
14 51 TGTC CGG CAT CAAcggcgtA GAATGGGACG CGGT CGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTC CGC 
1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
17 51 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTT CGG CCacatCtTC 
1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 
1851 cAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
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2001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 482; ORF 128.ng>: 

gl28-pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTI KNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFG I Y FDD AA P LAG I PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQI YRAYV TRAS ELSNDG 

2 51 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLiATKMADTP EQVLNFLHDL 

3 01 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 
401 IGGVYMDLYA REG KRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD V AATGKR FWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR QSGFDNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 483>: 

ml28.seq (partial) 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC AT CAAAGC C C AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTGCGTCGC CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCAC 

1 TACGCCAGCG AAAAACTGCG CGAAGCCAAA TACGCGTTCA GCGAAACCGA 

51 wGTCAAAAAA TAyTTCCCyG TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 

101 AAmTCAAAAA ACTmTACGGC AT CGG ATTT A CCGAAAAAAC yGTCCCCGTC 

151 TGG CAC AAAG ACGTGCGCTA TTkTGAATTG CAACAAAACG GCGAAmCCAT 

2 01 AGGCGGCGTT TATATGGATT TGTACGCACG CGAAGGCAAA CGCGGCGGCG 
251 CGTGGATGAA CGACTACAAA GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 

3 01 CAAyTGCCCA CCGCCTACCT CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 

3 51 CAGGGAAGCC CGCyTGAGCC ACGACGAAAT CCTCATCCTC TTCCACGAAA 

4 01 CCGGACACGG GCTGCACCAC CTGCTTACCC AAGTGGACGA ACTGGGCGTA 

4 51 TCCGGCATCA ACGGCGTAkA ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
501 TATGGAAAAT TTCGTTTGGG AATACAATGT CTTGGCACAA mTGTCAGCCC 

5 51 ACGAAGAAAC CGGcgTTCCC yTGCCGAAAG AACTCTT sGA CAAAwTGCTC 

6 01 GCCGCCAAAA ACTTCCAAsG CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
651 CGCCCTCTTT GATATGATGA TTTACAGCGA AGACGACGAA GGCCGTCTGA 

7 01 AAAACTGGCA ACAGGTTTTA GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
7 51 CAGCCGCCCG AATACAACCG CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
801 AGGCGGCTAT TCCGCAGCTn ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
851 GCGCGGACGC ATACGCCGCC TTTGAAGAAA GCGACGATGT CGCCGCCACA 
901 GGCAAACGCT TTTGGCAGGA AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
951 nGCAGAATCC TTCAAAG CCT TCCGCGGCCG CGAACCGAGC ATAGACGCAC 

1001 TCTTGCGCCA CAGCGGTTTC GACAACGCGG TCTGA 

This corresponds to the amino acid sequence <SEQ ED 484; ORF 128>: 

ml28.pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



// 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAWMND Y K GRRRFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FVWEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

2 51 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA E VL S AD A Y AA FEES DD V AAT 
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301 GKRFVJQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91 .7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

ml28/gl28 
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m i2 8 XVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFAXiSFGHIFAGGY 
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Ih M i 1 1 1 1 ! I 1 : 1 1 1 1 1 M I M I 1 1 1 1 ! 1 1 i 1 1 M 1 1 1 I llhlMMIIMIIM 
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280 290 300 310 320 330 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 485>: 

al28 . seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAAC C AT C A AAAACTCCCC 

351 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 C C AAAT C AAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

1401 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 

1651 TTCGCCCTCT TT GAT AT GAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 

17 51 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 486; ORF 128.a>: 

al28 . pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 
51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 
101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 
151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 
201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQI YRAYV TRASELSDDG 
251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLN FLHDL 
301 ARRAKPYAEK D LAE VKA FAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 
351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 
4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 
4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28/al28 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml28 pep MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

M | | | M ! I I I M I I I I M I M I I 1 I It I M It I I I I i I i i i I M I M I M M I I i I! I I 
al28 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml28 pep ERVGRIWGWSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
| | | | | | | t f 1 I II I I : 1 I M M I : I II I M I I I t ! I I I I I I I M It I I M I I I I I I I I I 
al2 8 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 



130 

ml 2 8 .pep TLS PAQKTKLNH- 



al28 



ml28 .pep 



ml28 .pep 



111 I I M I I I I 

TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



al2 8 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



al2 8 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATPCMADTPEQVLNFLHDL 

250 260 270 280 290 300 

140 150 

ml 2 8 pep YASEKLREAKYAFSETXVKKYFPVGX 

I I : I I I II I I II I I I I ( I I I I I I I 
al2 8 ARRAKPYAEKDIxAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

160 170 180 190 200 210 

ml 2 8 pep VLNGLFAQXKKL YG I GFTEKT V PVWHKDVRYXE LQQNGEX I GGV YMDL Y AREGKRGGAWM 

MM I I I M I II M I I M M M I I I I I M I II M M I I I I I M M 1 II It M I 

al2 8 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

M I II II I I I I I M I I II I M I I I I : I II M M I M I I I I I I I II I II I I I I M 

al2 8 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

430 440 450 4 60 470 480 

280 290 300 310 320 330 

ml 2 8 pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
II M I I M M I I M I I I I I I I I II I II I M I I I II II I I M I II I M M I I I M I I 
al2 8 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 

340 350 360 370 380 390 

ml28.pep XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
Ml III I I II II I I I M I M II I M I I I II M 11 II M II : M I II M I I MINI 
a 128 RGMFLVRQME FALFDMMI Y SEDDEGRLKNWQQVLDSVRKEVAWRPPE YNRFAN S FGH I F 

550 560 570 580 590 600 
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400 410 420 430 440 450 

AGGYSAAXYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRG 

| M | I I : I I I! I M I I I I I M I M I I I I I I I M M I I I I M I I II I M : I I M I II I I 
AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

460 470 
REPSIDALLRHSGFDNAVX 

I I I I I II I I I I I I I I I I : 
REPSIDALLRHSGFDNAAX 
670 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 487>: 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGG CTGGGCG 

151 AACACCGTCG AG CGTCTG AC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTC CGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

6 01 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATT CGAC A ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGG C AG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACG C A CGCGAAGGCA AACGCGGCGG 

12 51 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 
1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
1451 TGTCCGGCAT CAACGGCGTA AAA 

This corresponds to the amino acid sequence <SEQ ID 488; ORF 128-1. ng>: 

gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASEL.SNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHJJGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 
401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV K 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 489>: 

ml 2 8 - 1 . seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 
51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATCGCCGAAG 
101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 



ml28 . pep 
al28 

ml28 .pep 
a!28 
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2 01 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

2 51 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTG AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

5 01 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGG CATTT AC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

6 01 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 
6 51 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACG CTCGCAA ACGCCCTGCA 
801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
8 51 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG G CAT CGG ATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGG CGGCG TTTATATGGA TTTGT ACG C A CGCGAAGGCA AACGCGG CGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

14 01 AACCGGACAC GGG CTGCACC ACCTGCTTAC C C AAGTGG AC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

15 01 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

15 51 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AG AACTCTT C GACAAAATGC 
1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

16 51 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 
1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 
1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 
18 01 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 
1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
1951 GCGG CAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 
2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>: 

ml28-l.pep. 

1 MTDNALLH LG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLS PAQKTKli NHDLRD FVLS GAELPPEQQA 

151 E LAKIiQTEG A QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDAIAMFAA 

2 01 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

2 51 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

3 01 ARRAKPYAEK DLAEVKAFAR ESLNLADLQP WDLGYASEKL REAKYAFSET 
351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 I GG VYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 

4 51 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

5 01 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
5 51 FALFDMMI YS EDDEGRLKNW QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAE S FKAFRG REPS IDALLR HSGFDNAV* 

ial28- l/gl28-l 94.5% identity in 491 aa overlap 

10 20 30 40 50 60 

Q128 - 1 pep M I DNALLH LG E E PRFNQ I KTE D I KP A VQT AI AE ARGQ I AAVKAQTHTGWANTVERLTG I T 

I IMIIIIIMIIhllllllllihllllllll lllhlllllllllllll Hill 

ml28 - 1 MTDNALLH LG E E PRFDQ I KTE D I K P ALQTAI AEAREQ I AA I KAQTHTGW ANTVE PLTG I T 

10 20 30 40 50 60 

70 80 90 100 110 120 

gl28-l pep ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

Miii in hum ihini i ii hum iniiiii in i mill! mi i ii 1 1 1 

ml28-l ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 
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130 140 150 160 170 180 

ql28-l pep TLSPAQKTKlJDHDLRDFVLSGAELPPERQAEIAKIiQTEGAQLSAKFSQNVLDATDAFGIY 

I I I M Ml M M I I M M I I M M MMMM MM M II MM M MmM M I MM 1 1 

ml28-l TLSPAQICTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 

190 200 210 220 230 240 

ql 2 8 - 1 pep FDD AA P LAG I PEDALAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYAGNRELREQI YRAYV 

lllllllllllllllllillllllhlllllllllllllllllM! liMIIIIIIIH 

ml 2 8 - 1 FDD AA P LAG I PEDALAMFAAAAQSESKTGYKIGLQI PHYLAVIQYADNRELREQI YRAYV - 

190 200 210 220 230 240 

250 260 270 280 290 300 

ql28-l pep TRASELSNDGKFDNTANIDRTLENALKTAKLLGFKNYAELS LATKMADTPEQVLNFLHDL 

I I I II MMI Mi MMM M I ll>:|lllllllllllllll!llllllkllll!llll 

ml 2 8 - 1 TPJ^SELSDDGKFT3OTANIDRTLANALQTAKLIX3FKNYAELSIAT 

250 260 270 280 290 300 

310 320 330 340 350 360 

ql28-l pep ARRJUCPYAEKDIAEVKAFAREHLGLADPQPWDLSYAGEKIJ^EAK 

lllllllllllllllllllll hill IIIIMhlllllllMMIIIIIIUMII 

m i28 - 1 ARRAKPYAEKI)LAE\nCAFARESLNLADLK3PWDLGYASEKLREAKYAFSETE 

310 320 330 340 350 360 

370 380 390 400 410 420 

ql 2 8 - 1 pep VIAGLFAQIKKLYGIGFAEKTVPVWHKDVRYFELQQNGKTIGGVYMDLYAREGKRGGAWM 

II lllllllllllllhl MUM MMI I I I M MM III I MUM II M i : 

ml28-l VLNG L FAQ I KKL YG I G FT E KTV P WJHKD VRY F E LQQNG ET I GG VYMDL Y AREG KRGG AWM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ql2 8 - 1 pep NDYKGRRRFATK3TLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 

MIMMMMIMMMMIMI Ml MMMMIMM I 1 1 II I 1 1 1 I I I I I I I I 

ml 2 8 - 1 NDYKGRRRFSDGT3jQLPTAYXiVCNFAP PVGGREARLSHDE I LI LFHETGHGLHHLLTQVD 

430 440 450 460 470 480 



490 
E LGVSG I NGVK 

MIIIMMM 

E LGVSG I NG VEWDAVE L P SQFMEN FVWE YNVLAQMS AHE ETG VPL P KE L FDKMLAAKN FQ 
490 500 510 520 530 540 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 491>: 

al28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGG CGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC G AACTG CG CG 

251 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CAC CG AAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

3 51 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 
601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 
651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCG AGCTTT C AGACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACG CTCGAAA ACGCCCTGCA 
801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCG CTGGCAA 
851 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CG C CG AAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAAAGCCT CG GCCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGG CGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
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ml28-l 
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12 51 CGCGTGGATG AACGACTACA AAGGCCGCCG CCG TTTTTCA GACGGCACGC 
1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 
. 14 01 AACCGGACAC GGCCTGCACC ACCTG CTT AC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 
1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 
1551 CCACGAAGAA AC CGG CGTT C CCCTGCCGAA AG AACTCTT C GACAAAATGC 
1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 
1651 TTCGCCCTCT TTGATATGAT GATTT AC AG C GAAGACGACG AAGGCCGTCT 
1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 
1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 
1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 
1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCG CAGC 
1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ED 492; ORF 128-l.a>: 

al28-l.pep 

1 MTDNALLHIiG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFG I Y FDDAAPLAGI P EDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 I GG VYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 
451 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
6 01 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28-l/al28-l 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al28 - 1 pep MTDNALLHLGE E PRFDQ I KTED I KP AliQT AI AE AREQ I AAI KAQTHTG W ANTVE PLTG I T 

|| I I || I I I I I II Mill Ml II I III 'M i I II Mil IN Ml II ! I I il I III !l! II 

Ttll2 8 - 1 MTDNALLHLGEE PRFDQ I KTED I KPALQTAI AEAREQ I AAI KAQTHTGWANTVE PLTG I T 

10 20 30 40 50 60 

70 80 90 100 110 120 

al28-l pep ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

IMIMMIMM MMIIMIMMIMIMIMMIMMM II MIMIIIIMIM 

ml 2 8 - 1 ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 

130 140 150 160 170 180 

al28-l pep TLSHAQKTKI<NHDLRDFVIiSGAELPPEQQAELAKIiQTEGAQLSAJCFSQNvli 

Ml Ml I I I Ml M Ml III M I IM II IM I I MM MMIIII II! Ml III M I M 

ml2 8 - 1 TLSPAQKTKI^mDLRDFVLSGAELPPEC^AEIAKl^EGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 

190 200 210 220 230 240 

al 2 8 - 1 pep FDDAAPLAGI P ED ALAMFAAAAQS EG KTGYKIGLQ I PHYLAVIQYADNRKLREQI YRAYV 
I I I | 1 I I I I I I 1 I I 1 1 I I I I I 1 I I I = I I 1 1 I I I I 1 I 1 t I I 1 1 1 1 I 1 I I I = 1 I I 1 I I 1 1 t I 
ml28-l FDD AA P LAG I P ED A1AM FAAAAQ S ES KTG YKI G LQ I PHYLA VI Q YADN RELR E Q I YRAYV 

190 200 210 220 230 240 

250 260 270 280 290 300 

al2 8-l pep TRASELSDDGKFDtn , ANIDRTLENALQTAKI J IX5FKNYAELSLATKMADTPEQV^ 

Ml M I MM Ml MM IIMi 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I I I 

ml 2 8 - 1 TRAS E LS DDG KFDNT AN I DRT LANALQT AKLLG FKNY AE LS LAT KMADT PEQ VLNF LHD L 

250 260 270 280 290 300 

310 320 330 340 350 360 

al2 8 - 1 - pep AR RAKPYAEKDLA£VXA FARES 1X3 LAD I^PWDZ^ 

MM MM IM M I Ml! II I I MIM M M II MIMMMIMM M II M IM MM 
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ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDI^YASEKIjREIAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

370 ' 380 390 400 - 410 4~20 

al 2 8 - 1 pep VLNGL F AQ I KXL YG I G FT E KTVP VWHKD VR Y F E 1/QQNG E T I GG VYMD L Y ARE G KRGG AWM 
| I 1 | 1 t I 1 I I t 1 1 1 I 1 I 1 I 1 1 I 1 I 1 1 I I I 1 1 t I 1 t 1 I I I t I 1 t I t I I 1 I i 1 1 1 I I 1 1 1 1 I 
ml28-l vi^GLFAQIKKLYGIGFTEKTVPVWHKDVRYFEU^NGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

430 440 450 460 470 480 

al2 8 - 1 pep ND Y KG RRR FSDGTLQL PTAYL V CN FT P P VGG KEARLS HDE I LTLFH E TGHG LHHLLTQ VD 

I f 1 I [ 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 f ^ f 1 1 1 1 1 ! 1 M 1 1 1 1 1 lllllllllllllllll 

ml 2 8 - 1 ND YKGRRR FS DGT LiQL PT A YIj V CN F A P P VGGREARLiS HD E I L I LFH ETGHG LHH LLTQ VD 

430 440 450 460 470 480 

490 500 510 520 530 540 

al28-l pep E LG VS G I NGVEWD AVE LPS QFMEN FVWE YNVLAQMS AHE ETG VP LP KE LFDKMLAAKN FQ 

IMIIIIIIIMIIIIIIIIINIMIIIIIIIIlllMlllllllllllllllllllll 

ml28-l EixsvSGINGVEWDAVELPSQFMENFWEYNVLAQMSAHEETGVPLPKELFT^KMLAAKNFQ 
490 500 510 520 530 540 

550 560 570 580 590 600 

al2 8 - 1 pep RGMFLVRQMEFALFDMMI YSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 = = I i i 1 1 i 1 1 llllll 

ml2 8 - 1 RGMFLVRQMEFALFDMMI YSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

550 560 570 580 590 600 

610 620 630 640 650 660 

al2 8 - 1 pep aggysagyysyawaevlsadayaafeesddvaatgkrfwqeilavggsrsaaesfkafrg 
lllllillllllMIIMIIIIIIMIiillllllllMIIIIIIIIIIIMIIIIIMI 
mi28-i aggysagyysyawaevlsadayaafeesddvaatgkrfwqeilavggsrsaaesfkafrg 

610 620 630 640 650 660 

670 679 
al28-l .pep REPSIDALLRHSGFDNAAX 

1 k 1 1 1 1 L f 1 1 1 1 1 1 1 1 1 ^ 

ml28-l REPSIDALLRHSGFDNAVX 
670 

al28-l/ P44573 

sp|P44573|OPDA_HAEIN OLIGOPEPTIDASE A >gi | 1075082 | pir | | C64 055 oligopeptidase A (prlC) homolog 
- Haemophilus influenzae (strain Rd KW20) 

>gi | 1573174 (U32706) oligopeptidase A (prlC) [Haemophilus influenzae Rd] Length = 681 
Score = 591 bits (1507), Expect = e-168 

Identities = 309/677 (45%), Positives = 415/677 (60%) , Gaps = 4/677 (0%) 

Query: 4 NALLHLGEEPRFDQIKTEDIKPALQTXXXXXXXXXXXXXXXTHTGWANTVEPLTGITERV 63 

N LL+ + P F QIK E I + PA++ H W N + PLT +R+ 

Sbjct: 5 NPLLNIQGLPPFSQIKPEHIRPAVEKLIQDCRNTIEQVLKQPHFTWENFILPLTETNDRL 64 

Query: 64 GRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFDTLS 123 

R W VSHLNSV ++ ELR AY +P ++ + T +GQ LYN + +KNS EF S 
Sbjct: 65 NRAWSPVSHLNSVKNSTELREAYQTCLPLLSEYSTWVGQHKGLYNAYLALKNSAEFADYS 124 

Query: 124 HAQKTKLNHDIJIDFVLSGAELPPEQQAELAKMTEGA 183 

AQK + + LRDF LSG L E+Q ++ ++L+ ++FS NVLDAT + ++ 
Sbjct: 125 IAQKKAIENSLRDFELSGIGLSEEKQQRYGEIVARLSELNSQFSNNVLDATMGWEKLIEN 184 

Query: 184 AAPIAGIPEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYVTRA 243 

A LAG+PE AL +A+S+G GY+ L+IP YL V+ Y +NR LRE++YRAY TRA 

Sbjct: 185 EAEIAGLPESALQAAQQSAESKGLKGYRFTLEIPSYLPVMTYCENRALREEMYRAYATRA 244 

Query: 244 SELSDD-GKFDNTANIDRTLENAI^AKLLGFKNYAEI^IA^ 302 

SE + GK+DN+ ++ L ++ AKLLGF Y ELSLATKMA+ P+QVL+FL LA 
Sbjct: 24 5 S EQG PNAGKWDNS KVME E I LTLR VELAKLLG FNT YTELS LATKMAEN PQQVLDFLDHLAE 304 

Query: 303 RAKP YAEKDLAEVKAFARES l^LMLQPWDLGYAGEKLREAKYAFS ETEVKKYF PVGKVL 362 

RAKP EK+L E+K + + G+ +L PWD+G+ EK ++ YA ++ E++ YFP +V+ 
Sbjct: 305 RAKPQGE KE LQE LKG YCEKE FG VTE LAPWD I GFY S E KQ KQHLY A I ND EELRPYFP ENRV I 364 
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Query: 363 NG L FAQ I KKL YG I G FT E - KTVP VWH KD VR Y F E L - QQNG E T I GG VYMDL Y AREG KRGGAWM 420 

+GLF IK+ + + I E K V WHKDVR+F+L +N + G Y+DLYARE KRGGAWM 
Sbjct: 365 SGLFELIKRIFNIRAVERKGVDTWHKDVRFFDLIDENDQLRGSFYLDLYAREHKRGGAWM 424 

Query: 421 ^3DYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDErXXXXXXXXXXXXXXXXQVD 4 80 

+D GR+R DG++ + P AYL CNF P+G K A +H+E+ Q+D 
Sbjct: 425 DDC I GRKRKliDG S I ET PV A YLTCNFN AP I GN KP AL F THNE VTTLFH EFGHG I HHMLTQ I D 4 84 

Query: 481 ELGVSGINGVEWDAVELPSQFMENFWEY>mAQMSAHEETGVPLPKELFDKMIAAKNFQ 540 

V+GINGV WDAVELPSQFMEN+ WE LA +S H ETG PLPKE ++L AKNFQ 
Sbjct: 485 VSDVAGINGVPWDAVELPSQFMENWCWEEEALAFISGHYETGEPLPKEKIiTQLLKAKNFQ 544 

Query: 541 RGMFLVRQMEFALFDMMIYSEDDEGRIJaWC^VLDSVRKEVAVVRPPEYNRFANSFGHIF 600 

MF++RQ+EF +FD + + D + L SV+ +VAV++ ++ R +SF HIF 

Sbjct: 545 AAMFILRQLEFGIFDFRLHHTFDAEKTNQILDTLKSVKSQVAVIKGVDWARAPHSFSHIF 604 

Query: 6 01 XXXXXXXXXXXXWAEVLSADAYAAFEESDDV-AATGKRFWQEIIiAVGGSRSAAESFKAFR 65 9 

WAEVLSADAY+ FEE TGK F EIL GGS E FK FR 

Sbjct: 605 AGGYAAGYYSYLWAEVLSADAYSRFEEEGIFNPITGKSFLDEILTRGGSEEPMELFKRFR 664 

Query: 660 GREPSIDALLRHSGFDN 676 

GREP +DALLRH G N 
Sbjct: 665 GREPQLDALLRHKGIMN 681 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 493>: 

g!29 . seq 

1 ATGCTTTCAC CTCCTCGGCG TAAAACGGCG GCACATCAAT CAAGCCGTCT 

51 TTCATTTGCG TGCGGAAAAA ATGCGGCGTG TTGCCGTGAT CAAAATCAAT 

101 ATCGTGCAGC ATCCAGCCCA AATCGCGGTT TGCCTCGCTT TCCGATAACG 

151 CCGACGGCGG CAGCGGTTCA CCCTTATCCG CGCTTTCGCC ATTTGCCCTT 

2 01 TCAGGCTGCG GGCATAGGGG CGGAACAGGC GGCGGTCGAA TCCTGTTTCA 

2 51 TCCGGACAAA CGCGTTGGCA GTCGGAAAAT CCGGCCGGCC GTGTCAAATA 

3 01 ATGCGTTACT TTGGCCGGGT CTTGTCCTTT GTAAGCGGCG GTCTTTTTTT 
351 GCGCGCCATC CGCATCTGTT TGGGCGCATG GCAAACGGCG GCTGCCGTAC 

4 01 AATCAAAATG TTTGGCGATT TCATGCAGAC AGGCATCCGG ATGCCGCCCG 
4 51 ACATATCGAG CCGGTTTTTG CCTATCCGAT TTGGCGGCAT TTAGGCCGGT 
501 AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 494; ORF 129.ng>: 

gl29 -pep 

1 MLSPPRRKTA AHQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFGR VLSF VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 495>: 

ml29.seq (partial) 

1 . . TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCGGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA T CTGTTTGAG 

201 TGCATAGCAA ATGGTGG CTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

2 51 GCAGATAGGC ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORF 129>: 

ml29.pep (partial) 

1 . YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSAXQ MVAAVQSKCL AISCRXASGC CPTYXAGFCL 

101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 129 shows 79.1% identity over a 1 10 aa overlap with a predicted ORP (ORF 129.ng) 
from N. gonorrhoeae: 

m!29/gl29 

10 20 30 

ml 2 9 pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINT 

I I h = II I I I I I M : I I : I h M I h I = 
gl2 9 rdqnqYRAASSPNRGLPRFPITPTAAAVHPYPRFRHLPFQAAGIGAEQAAVESCFIRTNA 
30 40 50 60 70 80 

40 50 60 70 80 90 

ml2 9 pep LWGKFGRLCQIMRYFGRVLFFVSGGLFLRVIPICLSAXQMVAAVQSKCLAISCRXASGC 

hill II IMIIIMIII Mllllllhl llhl I ^ M I I I M I 1 1 M I I I I I 

g ! 2 9 LAVGKSGRPCQIMRYFGRVLS FVSGGLFLRAI RI CLGAWQTAAAVQS KCLAI S CRQASGC 

90 100 110 120 130 140 

100 no 
ml 2 9 . pep CPTYXAGFCLSDLTAFRPVTX 

III 1 1 1 1 1 1 1 I = 1 1 1 1 M I 

g 1 2 9 RPTYRAGFCLSDLAAFRPVTX 
150 160 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 497>: 

al29.seq (partial) 

1 TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ■ ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCAGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCTGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 498; ORF 129.a>: 

al29.pop (partial) 

1 YLRFHYLPFQ AAGIGTEQVA VKSCFIQINT LWGKFGQLC QIMRYFGRVL 

51 FFVSGGLFLR VIPIC LSA*Q MVAAVQSKCL AISCR*ASWC CPTY*AGFCL 

101 SDLTAFRPVT * 

ml29/al29 98.2% identity in 1 10 aa overlap 

10 20 30 40 50 60 

ml2 9 pep YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLWGKFGRLCQIMRYFGRVLFFVSGGLFLR 
| M | I I I M L I M I I I I I I I I I I I M I M M I I I M I : M I I I I I I I I I I I t M I I M M 
al 2 9 YLRFHYLPFQAAGIGTEQVAVKSCFIQINTLWGKFGQLCQIMRYFGRVLFFVSGGLFLR 

10 20 30 40 50 60 

70 80 90 100 110 

ml29 pep VIPICLSAXQMVAAVQSKCLAISCRXASGCCPTYXAGFCLSDLTAFRPVTX 
| | | | | | t | I I 1 I I ! I ! I I I I I I I I I M I I I M I I 1 i I M I I I I M I M I I 
al 29 VIPICLSAXQMVAAVQSKCLAISCRXASWCCPTYXAGFCLSDLTAFRPVTX 

70 80 90 100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 499>: 

gl30 . seq 

1 AT G AAA C AAC TCCGCGACAA CAAAGCCCAA GGCTCTGCAC TGTTTACCCT 

51 TGTGAGCGGT ATCGTTATTG TTATTGCAGT CCTTTATTTC CTGATTAAGC 

101 TGGCGGGCAG TGGATCGTTC GGCGATGTCG ATGCCACTAC GGAAGCGGCA 

151 ACGCAGACCC GCATCCAGCC TGT CGGACAA TTGACGATGG GTGACGGCAT 

201 CCCCGTCGGC GAACGCCAAG GCGAACAGAT TTTCGGCAAA ATCTGTATCC 
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2 51 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

3 01 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

3 51 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 
4 51 AGCGGCGGTT CTTTCCCGAA T C CTGATG AG GCTGCGCCTG CCGACAATGC 
501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 
551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 
601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 
6 51 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 
701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 
751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 
801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 500; ORF 130.ng>: 

gl3 0 .pep 

1 MKQLRDNKAQ GSALF TLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 501 >: 

ml3 0 . seq (partial) 

1 . . GGCGAACAGA TTTTCGGCAA AATCTGTATC CAATGCCACG CGGCGGACAG 

51 CAATGTGCCG AACGCTCCGA AACTGGAACA CAACGGCGAT TrGGCACCGC 

101 GTATCGgCAA GGCTTCGATA CCTTGTTCCA ACACGCGCTG AACGGCTTTA 

151 ACGCCATGCC TGCAAAAGGC GGTGCGGCAG ACCTGACCGA TCAGGAACTT 

201 AAACGGGCGA TTACTTAGAT GG CGAAC AAA AGCGGCGGTT CTTTCCCGAA 

2 51 TCCTGATGAG GCTGCGCCTG CCGACAATGC CGCTTCAGGA ACAGCTTCTG 
301 CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG CGAAGGCAGA AGACAAGGGT 

3 51 GCGGCACCCC TGCGGTCGGC GTTGACGGTA AAAAAGTCTT CGAAGCAACC 

4 01 TGTCAGGTGT GCCACGGCGG TTCGATTCCC GGTATTCCCG GCATAGGCAA 
4 51 AAAAGACGAT TGGGCACCGC GTATCAAAAA AGGCAAAGAA AC C TTGC AC A 
501 AACACGCCCT TGAAGGCTTT AACGCGATGC CTGCCAAArG CGgCAATGCA 
551 GGTTTGAGCG ATGACGAAgT CAAAGCGGCT GTTGACTATA TGGCAAACCA 
601 ATCCGGTGCA AAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

ml3 0 .pep (partial) 

1 . . GEQIFGKICI QCHAADSNVP NAPKLEHNGD X APR I QG FDT LFQHALNGFN 
51 AMPAKGGAAD LTDQELKRAI TYMANKSGGS FPNPDEAAPA DNAASGTASA 

101 PADSAAPAEA KAEDKGAAPA VGVDGKKVFE ATCQVCHGGS IPGIPGIGKK 
151 DDWAPRIKKG KETLH KHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 

201 GAKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 130 shows 98.1% identity over a 206 aa overlap with a predicted ORF (ORF 130.ng) 
from N. gonorrhoeae: 

ml30/g!30 



ml3 0 .pep 



10 20 30 

GEQIFGKICIQCHAADSNVPNAPKLEHNGD 

ill III Nil INN INI I INN I I II I 

gl30 DATTEAATQTRIQPVGQLTMGDGIPVGERQGEQIFGKICIQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 

40 50 60 70 80 89 

ml3 0 pep XAPR I - QGFDTLFQHALNGFN AM PAXGGAADLTDQELKRA I TYMANKSGGS FPNPDEAAP 

I I M I II Mil i I I I I I INI Nil I N INI II M N Nil N III M III N I I I I 

gl3 0 WAPRIAQGFDTLFQHALNGFNAMPAKGGAADLTDQELKRAI TYMANKSGGS FPNPDEAAP 

110 120 130 140 150 160 
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90 100 110 120 130 140 

ml 3 0 pep ADNAASGTASAPADSAAPAEAKAEDKGAA- P AVG VDG KKV F E ATCQVCHGG SIPGIPGIG 

Mill Mil > M Ml HIM M IN II II II I 1 1 M I II M I M 1 1 1 1 1 1 1 1 M M I 1 1 

gl30 ADNAAS GTASAP AD SAAPAEAKAEDKGAAAPAVGVDGKKVFE ATCQVCHGG SIPGIPGIG 

170 180 190 200 210 220 

150 160 170 180 190 200 

ml 3 0 pep KKDDWAPRIKXGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 

1 1 1 1 1 M 1 1 1 M II 1 1 1 1 M I H 1 1 1 1 1 II 1 1 I M 1 1 M M I M 1 1 1 1 1 1 1 II I M 

gl3 0 KKDDWAPRIKKGKETLHKHAIjEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 503>: 

al30.seq 

1 ATGAAACAAC tccgcgacaa caaagcccaa ggctctgcac tgtttaccct 

51 TGTGAGCGGT atcgttattg ttattgcagt cctttatttc ctgattaagc 

101 tggcgggcag cggctcgttc ggcgatgtcg atgccactac ggaagcagca 

151 acgcagaccc gtatccagcc tgtcggacaa ttgacgatgg gcgacggcat 

201 ccccgtcggc gaacgccaag gcgaacagat tttcggcaaa atctgtatcc 

251 aatgccacgc ggcggacagc aatgtgccga acgctccgaa actggaacac 

301 aacggcgatt gggcgccgcg tatcgcgcaa ggcttcgata ccttgttcca 

351 acacgcgctg aacggcttta acgccatgcc tgccaaaggc ggtgcggtag 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACTTACAT GGCGAACAAA 

4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 

501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 

551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 

601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 

651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 

7 01 AAGGCAAAGA AACCTTGCAC AAACACGCCC TTGAAGGCTT TAACGCGATG 

7 51 CCTGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 

8 01 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 

This corresponds to the amino acid sequence <SEQ ID 504; ORF 130.a>: 

al30 .pep 

1 MKQLRDNKAQ GSAL FTLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGQ LTMGDGIPVG ERQGEQIFGK ICIQCHAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAVDLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAA P AVG VDG 

201 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 



ml 30/al 30 97.6% identity in 206 aa overlap 

10 20 30 

GEQI FGKI C IQCHAADSNVPNAPKLEHNGD 

1 M I M ! If I ! I II I I I I I 1 I I I M I I I 1 I 
DATTEAATQTR I QPVGQLTMGDG I PVGERQGEQI FGKI C IQCHAADSNVPNAPKLEHNGD 
50 60 70 80 90 100 



ml 30. pep 
a!30 



40 50 60 70 80 89 

ml 30 pep XAPRI-QGFDTLFQHALNGFT^AMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 
|| I I I I I I I I I M I I I M I I I I I I I M : I I I I I I I I M II I I I I I I II I II I I I I I M 
a l 30 WAPRIAQGFDTLFQHALNGFNAMPAKGGAVDLTDQELKRAITYMANKSGGSFPNPDEAAP 
110 120 130 140 150 160 

90 100 110 120 130 140 

ml 30 pep ADNAASGTASAPADSAAPAEAKAEDKGAA- PAVGV DG KKV FEATCQVCHGGS I PGIPGIG 

| | I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M M I I I I I I I 1 I I I I I I M I I ! 
a 130 A DN AAS GT AS A P ADS AAP AE AKAE DKG AAA P A VGVDGKKV FEATCQVCHGGS I PGIPGIG 

170 180 190 200 210 220 



ml30 .pep 



150 160 170 180 190 200 

KKDDWAPRIKKGKETLHKHALEGFNAMPAKXGNAGLSDDEVKAAVDYMANQSGAKFX 
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| I | I I M | I I I ! I I I I I I I I I I I I II I I I I I I M I I 1 I I M M I M I I I II M I M 
KKDDWAPRIKKGKETLHKHALEGFNAMPAKGGNAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 505>: 

gl32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGG AT T ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTcct CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt atattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcag caacataCGA CAGACTCAGC AagcACCCAA 

301 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

gl32 .pep 

! MEAFKTLIWI INIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 507>: 

ml32.seq (partial) 

1 ATGGAACCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA . . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 

ml32.pep (partial) 

1 MEPFKTLIWI VNLISALAV F VLVLLQHGKG ADAGATFG . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from K gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/gl32 

10 20 30 

ml32 pep MEPFKTLIWIVNLISALAVFVLVLLQHGKGADAGATFG 

M I I I I ! I I : I = I I ! I II = I II I II I I 1 II I I I M M 
gl 3 2 MEAFKTLIWI INI I SALAVI VLVLLQHGKGADAGATFGSGSGSAQGVFGS AGNANFLSRS 

10 20 30 40 50 60 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 509>: 

al32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

251 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132.a>: 

al32 .pep 

1 MEAFKTLIWI VNIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 



ml32/al32 92.1% identity in 38 aa overlap 
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10 20 30 

ml32 .pep MEPFKTLIWIVNLISALAVFVLVLLQHGKGADAGATFG 
II I I M I I i I I : f f I If I : i f M I I t f II I I M I M I 
al32 MEAFKTLIWIVNIISALAVIVLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 51 1>: 

gl34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCCGATGCGG GTAAAACCAC GCTGACCGAA AAACTGCTGC 

101 TGTTTTCGGG CGCGATTCAA AGCGCAGGCA CGGTGAAAGG TAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA C AC CAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CAGTGGACAG CGCCTTGATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AACTCTTGAA CGTCTGCCGC CTGCGCGATA 

4 01 CGCCGATTGT TACCTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCT 
451 TTGGAACTCT TGGACGAAGT GGAAGACATC CTGCAAATCC GCTGCGCGCC 

5 01 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATAAAC AATCCCGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAATTTAAT CTCGacgaAT TTCTCGccgG CGAACTCACG 
751 CCAGTGTTCT TCGGCTCTGC GATTAACAAC TTCGGCATTC AGGAAATCCT 
801 CAATT CATTG ATTGACTGGG CACCCGCACC GAAACCGCGC GACGCGACCA 
851 TGCGCATGGT CGGGCCGGAC GAGCCGAAAT TTTCCGGATT T AT CTTT AAA 
901 ATCCAAGCCA AT ATGGACC C GAAACACCGC GACCGTATCG CCTTCTTGCG 
951 CGTCTGCTCC GGTAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTAG TAACCTTCAT GTCGCACGAC 

1051 CGCGAACTGG CGGAAGAAGC CTACGCCGGC GACATCATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG G CGAC AG C TT CTCCGAAGGC GAACAACTGG 

1151 CGTTTACCGG CATCCCATTC TTCGCGCCCG AACTGTTCCG CAGCGTCCGC 

1201 AT C AAAAAC C CGCTGAAAAT CAAACAACTG CAAAAAGGTT TGCAACAACT 

1251 CGGCGAAGAA GGTGCGGTTC AAGTATTCAA ACCGATGAGC GGCG CGGATT 

1301 TGATTTTGGG TGCGGTCGGC GTGTTGCAGT TTGAAGTCGT AACCTCACGC 

13 51 CTCGCCAACG AATACGGCGT GGAAGCCGTG TTCGACAGCG CATC CAT CTG 

1401 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCGGAATTTG 

1451 AAAAAGCCAA CGCAGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

1501 TACCTCGCCC CCAACCGCGT GAATTTGGGG TTGACGCAAG AACGCTGGCC 

1551 GGACATCGTG TTCCACGAAA CGCGCGAACA TTCGGTCAAA CTCTAA 

This corresponds to the amino acid sequence <SEQ ID 512; ORF 134.ng>: 

gl34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVEDI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

2 01 HEFDIIKGIN NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATMRMVGPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELAEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

4 51 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAI DAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 513>: 

ml34,seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAA CTCTTGC 

101 TGTTTTCGGG CGCGATTCAG AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACTTCCGA CTGGATGGAA ATCGAGAAGC AGCGCGGCAT 
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2 01 TTCCGTGGCA T CAAGTGTG A TGCAGTTCGA TTACAAAGAC CACACCGTCA 
251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG C C GTGG AC AG CGCATTAATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 

4 01 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 
4 51 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 
501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG AC AT CAT C AA AGGCATCGAT AATCCTGAAT TGGAACAACG 

6 51 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

7 01 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 
751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 
851 TACGTATGGT CGAGCCGGAC GAG C CGAAGT TTTCCGGATT TATCTTCAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 
951 CGTCTGCTCC GGCAAATTCG AGCGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACCTTCAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACAT T AT CG GCATCCCGAA 

1101 CCACGGCAAC AT C C AAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

1201 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAGCT 

12 51 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

13 01 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

13 51 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CAT C CAT C TG 

14 01 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 
1451 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 

15 01 TACCTCGCCC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 
1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 

ml34 -pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM V I D AAKGVE A QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNE FN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 REL.VEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADL I LGAVG VLQFEWT SR 

4 51 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AE FE KAN AGN LAI DAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 134 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 
from N. gonorrhoeae: 

ml34/gl34 

10 20 30 40 50 60 

ml34 . pep MSQEILDQVRRRRTFAI ISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 

II Mill Ml II Mill I I Ml MUM M II i I I M M MM I M I II MMMIMM 

g i34 MSQEILDQVRRRRTFAI ISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml34 pep IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

II MMI I II II M IM II Ml MMI I M Mi MM M ,MII I I I M I IMMMI II 

g i34 IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

70 80 90 100 110 120 

130 140 150 160 170 180 
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ml 3 4 pep QTI KLLNVCRLRDTP I VTFMNKYDREVRDSLELLDEVEN I LKI RCAPVTWPIGMGKNFKG 

1 1 E 1 1 i j 1 1 1 1 i 1 1 1 1 1 i l i 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i ( 1 1 = l i 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 

gl34 qt I KLLNVCRLRDT P I VT FMNKYDREVRDSLELLDEV ED I LQ I RCAPVTWPIGMGKNFKG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml34 pep VYHILNDEIYLFEAGGERLPHEFDI IKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 i I i ! I M I i 1 1 ^ 1 1 1 1 1 i 1 i i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M M I ! 

gl34 VYHILNDEIYLFEAGGERLPHEFDI I KG I NNPELEQRFPLE I QQLRDE I ELVQAASNE FN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml34 pep LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 

I 1 1 I M I I M M 1 1 I M i 1 1 1 It M M i M 1 1 1 I I M 1 1 1 1 1 I = i 1 1 IIIMIIMIII 

gl34 LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATMRMVGPDEPKFSGFIFK 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 3 4 pep IQANMDPKHRDRI AFLRVCSGKFERGMKMKHLRINREIAASSWTFMSHDRELVEEAYAG 

I M M 1 1 1 1 M M 1 1 ! I M 1 1 1 1 i M 1 1 1 f 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 = I ! I 1 11 

gl34 IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASSVVTFT^SHDRELAEEAYAG 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml34 pep DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

|| || IN I 1 1 1 1 1 1 1 1 1 Mil MM IN I MINI Mill MINI III MINIM I II 

gl34 DI IGI PNHGNIQIGDSFSEGEQLAFTGI PFFAPELFRSVRI KNPLKI KQLQKGLQQLGEE 

370 380 390 400 410 420 

43 0 440 450 460 470 480 

ml34 pep GAVQVFKPMSGADLILGAVGVLQFEVVTSRLANEYGVEAVFDSASIWSARWSCDDKKKL 

M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I II 1 1 1 II I II 1 1 I I 1 1 M I 1 1 1 1 1 1 1 I I M 1 1 1 1 I M 1 1 [ 

gl34 QAVQVFKPMSGADLILGAVGVLQFEVVTSRLANEYGVEAVFDSASIWSARWVSCDDKKKL 

4 30 440 450 460 470 480 



ml34 -pep 
gl34 



490 500 510 520 530 

AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

II || 1 1 II I II M II II II M M I M M II I M 1 1 II I M M M II M II II 

AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 
490 500 510 520 530 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 515>: 

al34 . seq 

1 ATGTCCCAAG AAATCCTCGA CCAAGTGCGC CGCCGCCGCA CGTTTGCCAT 

51 CATCTCCCAC CCTGACGCAG GTAAAACCAC GTTGACTGAA AAACTCTTGC 

101 TGTTTTCAGG TGCGATTCAA AGCGCGGGTA CGGTAAAAGG CAAGAAAACC 

151 GGCAAATTCG CCACCTCCGA CTGGATGGAC ATCGAGAAGC AGCGCGGCAT 

201 TTCCGTGGCA TCAAGCGTGA TGCAGTTCGA CTATAAAGAC CACACCGTCA 

251 ACCTTTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

301 GTTTTGACCG CCGTCGATAG TGCCTTGATG GTCATCGACG CGGCAAAAGG 

351 CGTGGAAGCG C AAAC CAT CA AACTCTTGAA CGTCTGCCGC CTGCGCAATA 

4 01 CGCCGATTGT TACGTTCATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

4 51 CTGGAATTGC TGGACGAAGT GGAAAAC AT C CTGCAAATCC GCTGCGCGCC 

501 CGTAACCTGG CCGATCGGCA TGGGCAAAAA CTTCAAAGGC GTGTACCACA 

551 TCCTGAACGA CGAAATCTAT CTCTTTGAAG CGGGCGGCGA ACGCTTGCCG 

601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCCGAAT TGGAACAACG 

651 CTTTCCGTTA GAAATACAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 

7 01 CGGCTTCCAA CGAGTTCAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 

7 51 CCCGTATTCT TCGGCTCTGC GATTAACAAC TTCGGTATTC AGGAAATCCT 

801 CAATTCATTG ATTGAATGGG CGCCCGCGCC GAAACCACGC GATGCGACCG 

851 TGCGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 

901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 
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951 CGTCTGCTCC GGCAAATTCG 

1001 TCAACCGCGA AATCGCCGCC 

1051 CGCGAGCTGG TTGAAGAAGC 

1101 CCACGGCAAC ATCCAAATCG 

1151 CGTTTACCGG CATCCCATTC 

1201 AT C AAAAAC C CGCTGAAAAT 

1251 TGGCGAAGAA GGTGCGGTGC 

1301 TGATTTTGGG CGCGGTCGGC 

1351 CTTGCCAACG AATACGGCGT 

14 01 GTCGGCGCGC TGGGTATCGT 

14 51 AAAAAGCCAA CGCGGGCAAC 

1501 TACCTCGCCC CTAACCGCGT 

1551 GGACATCGTG TTCCACGAAA 



AGCGCGGCAT GAAAATGAAA CACCTGCGTA 
TCCAGCGTGG TAACCTTCAT GTCCCACGAC 
CTACGCCGGC GACATTATCG GTATCCCAAA 
GCGACAGCTT CTCCGAAGGC GAACAACTGA 
TTCGCGCCCG AACTGTTCCG CAGCGTTCGC 
CAAGCAACTG CAAAAAGGTT TGCAACAGCT 
AGGTGTTCAA ACCAATGAGC GGCGCGGATT 
GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 
GGAAGCCGTG TTCGACAACG CATCCATCTG 
GCGACGACAA GAAAAAACTG GCGGAATTTG 
CTCGCCATCG ACGCGGGCGG CAACCTCGCC 
GAATCTGGGA CTCACGCAAG AACGCTGGCC 
CGCGCGAGCA TTCGGTCAAA CTTTAA 



This corresponds to the amino acid sequence <SEQ ID 516; ORF 134.a>: 

al34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VI DAAKGVEA QTIKLLNVCR LRNTPIVTFM NKYDREVRDS 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILN DEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IEWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSVVTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLT FTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

451 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

ml34/al34 98.9% identity in 531 aa overlap 

10 20 30 40 50 60 

MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 

M It I I I M I I I I ! I I ! I I M I! M I M i t I I I I I I I I I I I I I M II I I I I II II II I I : 
MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 
10 20 30 40 50 60 



ml 3 4 . pep 
al34 



70 80 90 100 110 120 

ml34 pep I EKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVI DAAKGVEA 

M | | I M I M I I I I I I I I I I I I I I I M I I I I M M I I I I I II M I I I I I II I M I I I M I 
al34 IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVI DAAKGVEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 34 pep QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVENILKIRCAPVTWPIGMGKNFKG 
( I M I I I I I I I I : I I I I I I I I I I I I I I I I I I 11 I I I I M M : I i I I I I I M I I II I I I I I 
ai3 4 QTIKLLNVCRLRNTPIVTFMNKYDREVRDSLELLDEVENILQIRCAPVTWPIGMGKNFKG 

130 140 150 160 170 180 

190 200 210 220 : 230 240 

ml3 4 pep VYH I LNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLR.DE IE LVQAASNEFN 

I | | I I I I I I I I I I I I I I I I 1 I I I M I II I I I I I I I I I I M I I I I I II I I M I I II I I I I I 
a 13 4 VYHILNDEIYLFEAGGERLPHEFDIIKGIDNPELEQRFPLEIQQLRDEIELVQAASNEFN 

190 200 210 220 .230 240 

250 260 270 280 290 300 

ml3 4 pep LDE FLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 

I | | | I I | i | I | | II M II I I I I I I I I I I I M : I M I I I I I I I I I I I I II I I M I I I I I I I 
a 134 LDE FLAGELT PVFFGSAINNFGIQEILNSLIEWAPAPKPRDATVRMVEPDEPKFSGFIFK 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 3 4 . pep IQANMDPKHRDRIAFLRVCSGKFERGMKMKHLRINREIAASS WTFMSHDRELVEEAYAG 

I I I I I I I I I I I I I t 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

al3 4 I QANMDPKHRDR I AFLRVCSGKFERGMKMKHLRINRE I AASS WTFMSHDRELVEEAYAG 

310 320 330 340 350 360 
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370 380 390 400 410 420 

ml34 pep DIIGIPNHGNIQIGDSFSEGEQLAFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 
I I I | M I M I I I ! M I t I I i M I : I I M I M I I I I I I I i I M I I M I I I M II 1 1 I I I M 
al34 dugiPNHGNIQIGDSFSEGEQLTFTGIPFFAPELFRSVRIKNPLKIKQLQKGLQQLGEE 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 3 4 pep GAVQVFKPMSGADLILGAVGVLQFEWTSRLAMEYGVEAVFDSASIWSARWVSCDDKKKL 
| | | | | M I II I M I I I 1 I I M I I I I 1 I I I I I I I I M M i I I I : I I I I I I I I I I I I M I I I 
al34 GAVQVFKPMSGADLILGAVGVLQFEWTSRLANEYGVEAVFDNASIWSARWVSCDDKKKL 

430 440 450 460 470 480 

490 500 510 520 530 

ml34 pep AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTQERWPDIVFHETREHSVKLX 

| | | | U I I I I I I I I I I M I i I I I I M I I II I I M I I I I II I M I I I I I I I I I 
al34 AEFEKANAGNLAIDAGGNLAYLAPNRVNLGLTOERWPDIVFHETREHSVKLX 

490 500 510 520 530 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 517>: 

g!35.seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACAT CTT CGATTACCCG 

51 TTCGGAcgGC AGCCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCTG CAGGGTTCGG CGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

2 01 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 
251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

3 01 CTGCTCAGCC GTGCCGACTT TGC CGACAAA CGCCGCTACC AAAATGCCGG 

3 51 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCGATTCCC AT CAT C AAT G 

4 01 AAAACGACAC GGTTTCGGTT GAGGAGTTGA AAATCGGCGA C AAC G AC AC A 
4 51 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 
501 G AC CG AC AT A GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 
551 CCGTACGGCT GGACAAAATC G AAC AC AT CA ACCATGAAAT CAT C G AAAT G 
601 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 
651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 
701 CCTCACTCAA AC C CG AT T C A TTGGCCGAAG CCGCCGAACA TCAGGCGGAC 
7 51 GGCTCGTTTT TCGTcccCcg tgCCAAAGGT TTGCGGACAC AGAAGCAATG 
801 GctggCGTTC TATTCcgaaa gcggGGgcag cgttTAtgtg gacgaaagtg 
851 cggaacacgc tTtgtccgaa caagggaaag cctgCTGA 

This corresponds to the amino acid sequence <SEQ ID 518; ORF 135.ng>: 

gl3 5 .pep 

1 MKYKRIVFKV GTSSITRSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKI AD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAIP IINENDTVSV EELKIGDNDT 

1 51 LSAQVAAMIQ ADLLVLL TD I DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDS LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESGGSVYV DESAEHALSE QGKAC* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 519>: 

ml35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCTGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCGCAAATC 

3 01 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

3 51 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC AT C ATC AATG 

4 01 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 
4 51 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 



WO 99/057280 



383 



PCT/US99/09346 



501 GACCGACATA GACGGTCTTT ACACGGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

6 01 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

6 51 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG ' TATATCTGTT 

7 01 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CTGCCGAACA TCAGGCGGAC 
751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAG CAATG 

8 01 GCTGGCGTTC T ATT C CG AAA GCCGGGGCAG CGTTTATGTG GACGAAGGTG 
851 CGGAACACGC TTTGTCCGAA CAGGGGAAAA GCCTGCTGAT GTCGGGCATT 
901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 
951 CAAGGCAACC AAACAGCCCC TGGGCAAAGG GCGCGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATCGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC G AAAT ACG C C TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml3 5 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHHA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKI KAA TIAAESGVPV YICSSLKPDA LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYS KAT KQPLGKGRVL FGSAAAEDLL KSRKAKGVFI 

351 HRDDWISITP EIRLLLTEF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from N. gonorrhoeae: 

ml35/gl35 

10 20 30 40 50 60 

ml 3 5 pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 

I M 1 1 1 1 M I M 1 1 1 1 : 1 1 1 1 1 II 1 1 1 II 1 1 II I II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 M 1 1 1 1 I 

gl35 MKYKRIVFKVGTSSITRSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml3 5 pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

I I M I 1 1 I i 1 1 ! t ! i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 i 1 1 M I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 

g i3 5 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 5 pep SVLLQRRAVP 1 1 NENDTV S VE ELKIGDNDTLSAQVAAM I Q ADLLVLLTDI DGLYTGNPNS 

I ! I I I I I h I I III Ml I I II I Mil I I IMM1 Ml !! I , II I I MM I II ,! II MM 

g ! 3 5 S VLLQRRAI P 1 1 NENDTVS VE E LKI GDNDTLS AQVAAM I QADLLVLLTD I DGLYTGN PNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 3 5 pep NPDAVRLDKI EHINHEII EMAGGSGS ANGTGGMLTKI KAAT I AAESGV P VY I CS S LKPDA 

Ml III! Ml IMIMMM IMM M MMIM MIMMMIMM II Mill M M 

g i3 5 NPDAVRLDKI EH I NHE 1 1 EMAGGSGS ANGTGGMLTKI KAAT I AAESGVPVY I CSSLKPDS 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml3 5 pep IJVEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEOGKSLLMSGI 

Mill MMMIMIMM Ml MIIIMMMI I I M M M I M 1 1 M 1 1 M 

g 135 LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESGGSVYVDESAEHALSEQGKACX 

250 260 270 280 290 

310 320 330 340 350 360 
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ml3 5 .pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 52 1>: 

al35 . seq 

1 ATGAAATACA AAAGAATCGT ATTTAAAGTC GGCACATCTT CGATTACCCA 

51 TTCGGACGGC AGTCTCTCGC GCGGCAAAAT CCAAACCATC ACCCGCCAGC 

101 TTGCCGCATT GCATCATGCG GGACACGAGC TGGTCTTGGT GTCTTCCGGC 

151 GCGGTTGCGG CAGGGTTCGG TGCGCTGGGT TTCAAAAAAC GTCCGGTCAA 

201 AATCGCCGAC AAACAGGCTT CCGCCGCCGT CGGGCAGGGG CTGCTGATGG 

251 AAGAATATAC GGCAAACCTG TCTTCAGACG GCATCGTGTC CGCACAAATC 

301 CTGCTCAGCC GCGCCGACTT TGCCGACAAA CGCCGCTACC AAAATGCCGG 

351 CGGCGCACTT TCCGTGCTGC TGCAACGCCG CGCCGTCCCC ATCATCAATG 

4 01 AAAACGATAC GGTTTCGGTT GAGGAATTGA AAATCGGCGA CAACGACACA 

4 51 TTGAGTGCGC AAGTGGCGGC GATGATACAG GCAGACCTCT TGGTGCTGCT 

501 GACCGACATA GACGGTCTTT ACACCGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

601 GCGGGCGGCT CGGGTTCGGC AAACGGCACA GGCGGTATGC TGACTAAAAT 

651 CAAAGCGGCG ACGATTGCGA CCGAGTCCGG CGTACCGGTC TATATCTGTT " 

7 01 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CGGCAGATAA TCAGGCGGAC 
751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 
801 GCTGGCGTTC TATTCCGAAA GCAGGGGCGG CGTTTATGTG GACGAAGGTG 

8 51 CGGAACACGC TTTGTCCGAA CAGGGAAAAA GCCTGCTGAT GTCGGGCATT 
901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 
951 CAAGGCAACC AAACAGCCTT TGGGCAAAGG GCGAGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATTGCGTA AGGCGAAAGG CGTGTTCATC 

1051 CATCGGGACG ACTGGATTTC CATCACGCCC GAAATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 522; ORF 135.a>: 

al35 . pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TRQLAALHHA GHE LVLVSSG 

51 AVAAG FG AIXj FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGN PNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIATESGVPV YICSSLKPDA LAEAADNQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGGVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KLRKAKGVFI 

351 HRDDWISITP EIRLLLTEF* 

m!35/al35 98.4% identity in 369 aa overlap 

10 20 30 40 50 60 

ml 35 pep MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 

I | | | | | M I II I I I ! I I M I I I I I II M I I I I I 1 I I I I I I I I I I I M II I I I I 

a 135 MKYKRIVFKVGTSSITHSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml35 pep FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

I I M I I I I I I I I M I I I I I II 1 I I I I I I II I I I I I II I M I I I I II I I 1 I I I I I I 

a 135 FKKRPVKIADKQASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 5 pep SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDI DGLYTGN PNS 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 1 I I I I I I I I i I I I I I I I I I I M I I I I I I I 
al 35 SVLLQRRAVPIINENDTVSVEELKIGDN DTLSAQVAAMIQADLLVLLTDIDGLYTGN PNS 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml35.pep NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIAAESGVPVYICSSLKPDA 

II I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I : I I M II II II M I I M 
a i3 5 NPDAVRLDKIEHINHEIIEMAGGSGSANGTGGMLTKIKAATIATESGVPVYICSSLKPDA 

190 200 210 220 230 240 
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250 260 270 280 290 300 

LAEAAEHQADGSFFVPRAKGLRTQKQWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 

| | | I | : : I M I I I I II I I II I I I I I II I I I I : I I M I M M II I I I I I II I I I I I 

LAEAADNQADGSFFVPRAKGLRTQKQWLAFYSESRGGVYVDEGAEHALSEQGKSLLMSGI 

250 260 270 280 290 300 

310 320 330 340 350 360 

AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 

I | | | I I I I I I M I I M II II M I I I I I I M I II I I I I! I I I I I Ill 

AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRKAKGVFIHRDDWISITP 
310 320 330 340 350 360 

370 

EIRLLLTEFX 
I I I I I I I I I I 
EIRLLLTEFX 
370 



The following partial DNA sequence was identified in jV. gonorrhoeae <SEQ ID 523>: 

gl36 . seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTGCGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccactgccat 

301 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 

3 51 AATCGGTCGG CACAATGTCC AAACCGTAAA GATAGGCGTT GCACCAAGTG 

4 01 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 
451 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 
501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 
551 AACACTTCGG GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 
601 CAGCGCCGTC AT AAAAC CTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 
651 GTTCGCTTTT GGC AT CCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl3 6 . pep 

1 MEIRFQTAFL RLVQMKTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 LRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKQLFKRF IIGGFKPIGR HNVQTV KIGV APSVKIAAAL AVW EPQIGQ 

151 LFIRHRGGCF HRHCQNQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 525>: 
ml36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGTTTGTT GATGACTGCC TGCCAGTAGC GGT AG AT AT C 

151 CGCCAATGCA TAAGGCAACT CGGATT CC AG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGCG CCGTTTTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 
301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA T AAG CATTGC 

3 51 AC CATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAG C C AC AAA 
401 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 
451 C AAAAC CAG C CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 
501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 
551 CCGCTCAACA GCGCCGTCAT AAAAC CTTGA ACCTCGTCGC AACGCATCGT 
601 GTTGCCTTGT TCGCTTTTGG CAT CCAACAA TTCGCTCAAC CGCCGTTTGG 
651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CAAATGGGTT 
701 TTGCGCCCTA TTATCGCCGC AATGCCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 



ml35 . pep 
al35 

ml35 . pep 
a!35 

ml35 . pep 
a!35 
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ml36 . pep 

1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRFV DDCLPVAVDI 

51 RQCIRQLGFQ FRQLAFCELQ TDSAVFLFW NTAQCHDGIK QLFKRFI IDG 

101 FKPIGRHNIQ TVK ISIAPCV KIAAAVFV FI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF QMGFAPYYRR NAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 136 shows 85.6% identity over a 209 aa overlap with a predicted ORF (ORF 136.ng) 
from N. gonorrhoeae: 

10 20 30 40 

METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPV 

hllltlilllllll 1 1 1 1 1 i 1 1 1 1 1 II IIMMIMII III 

ME I RFQT AF LRL VQM KTN AS I L T AT RL»V F P AAAARTG I V PAG F F P F P AD G LR FVDDRL P V 
10 20 30 40 50 60 



ml36/gl36 

ml3 6 .pep 
gl36 



50 60 70 80 90 100 

AVDIRQCIRQLGFQFRQLAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGR 

IN: | :||:| :|||lll M I* I • I I I I I I I I : I : I I hlllllllll MIIMI 
AVDVCQRVRQFGRKFRQLAFGELQADNAVFLFWNAAHCHHGVKQLFKRFIIGGFKPIGR 

70 80 90 100 110 120 



110 120 130 140 150 160 

HNI QTVKI S I APCVKI AAAVFVF I QPQ I GQ FF I RHRGGCFHRHCQNQPFDFGTFGGGKLR 

||:|||||::|| I I I U M I - I I I I I = I I I I I I I I I M I I I I I I I M I I I II I I i I 
HNVQTVKIGVAPSVKIAAALAVVVEPQ I GOLF I RHRGGCFHRHCQNQPFDFGTFGGGKLR 

130 140 150 160 170 180 



170 180 190 200 210 220 

ml 3 6 pep FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIH 

M M II III II MM I IIMIIMI I M II MUM MMM 

g 1 3 6 FVAQHFGQPVERCQFVRPAQQRRHKTLNLVATHRVALFAFGI QX 

190 200 210 220 



230 240 

ml 3 6 . pep HFPFQMGFAPYYRRNAVX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 527>: 

al36 . seq 

1 ATGGAAACAA ACGCTTCAAT TCTTACCGCA ACACGCCTTG TATTTTCTGC 

51 CGCTGCCGCA CGGACAGGGA TCGTTCCTGC CTGTTTTTTC GCCTTCCCTG 

101 CGGACGGTTT GCGGCTTGTT GATGACCGCC TGCCAGTAGC GGTAGATATC 

151 CGCCAATGCA TAAGGCAACT CGGATTCCAG TTCCGCCAGC TCGCCTTCTG 

201 TGAATTGCAG ACGGATAGTG CCGTTGTCCT CTTCGTCGTA AATACCGCCC 

251 AATGCCATGA TGGGATAAAA CAACTCTTCA AACGCTTCAT CATCGACGGC 

301 TTCAAACCAA TCGGTCGGCA CAATATCCAA ACCGTAAAGA TAAGCATTGC 

351 ACCATGTGTA AAAATCGCTG CCGCCGTCTT CGTTTTCATA CAGCCACAAA 

4 01 TCGGGCAGTT TTTTATCCGA CATCGCGGCG GTTGTTTCCA TCGCCATTGC 

4 51 CAAAACCAGC CGTTCGATTT CGGAACGTTC GGCGGCGGTA AATTGCGATT 

501 CGTCGCCCAA CACTTCGGGC AGCCAGTCGA GCGGTGTCAA TTTGTCCGGC 

551 CCGCTCAACA GCGCCGTCAT AAAACCTTGA ACCTCGTCGC AACGCATCGT 

601 GTTGCCTTGT TCGCTTTTGG CATCCAACAA TTCGCTCAAC CGCCGTTTGG 

651 ATGCTTCGGT AAATTTTCGG GAATCCATCA TTTTCCTTTT CCAATGGGTT 

7 01 TTGCGCCCTA TTATAGTGGA TTAAATTTAA ATCAGGACAA GGCGACGAAG 

7 51 CCGCAGACAG TACAAATAGT ACGGCAAGGC GAGGCAACGC CGTACTGGTT 

8 01 TAAATTTAAT CCACTATATC GCCGCAATGC CGTCTGA 



This corresponds to the amino acid sequence <SEQ ED 528; ORF 

al36-pep 



136.a>: 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 RQCIRQLGFQ FRQ LAFCELQ TDSAVVLFW NTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK ISIAPCV KIAAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

251 PQTVQIVRQG EATPYWFKFN PLYRRNAV* 

ml36/al36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml 3 6 pep METNASILTATRLVFSAAAARTGIVPACFFAFPADGLRFVDDCLPVAVDIRQCIRQLGFQ 
| | | M I M I I I I I I M I I I ! I M I! M I M I I I I M M : I M 1 I I ! I I I I 11 M M M I 
a ! 3 6 METN AS I LT ATRLVFS AAAARTG I V P AC FFAFPADGLRLVDDRLPVAV DI RQC I RQLG FQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml36 pep FRQLAFCELQTDSAVFLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

| M I I I I I I I I I I M I M I I 11 I M I I I M I I ! I ! I I I I I 1 I I . I I I I I I I ! 

al36 FRQLAFCELQTDSAWLFWNTAQCHDGIKQLFKRFIIDGFKPIGRHNIQTVKISIAPCV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 3 6 pep KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

Ml f | M 1 I I I I I I I I I M II I 1 I I I t I I M 1 I I I I I I M M I I 1 I 11 I 

al36 KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 3 6 pep FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFQMGFAPYYRR 

| | | | | I I I I I I I I I I I I 1 I I I M I I I I I I I I I I I I I I I I I M 1 1 i I M I I I I I 11 I I 
ai36 FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 

ml 3 6. pep NAVX 

a 13 6 LNLNQDKATKPQTVQIVRQGEATPYWFKFNPLYRRNAVX 

250 260 270 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 529>: 

gl37 . seq 

1 ATGATTATCC ATCACcaaTT CGATCCCGTC CTCATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCT TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT G AC ATGGGG C ATTTTGGGCG TGATTTTGGG 

201 CGGACGCTTG GGCTATGTCC TGTTTTACAA ATTCTCCGAC TACCTCGCCC 

2 51 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

3 01 GGCTTTTTGG GTGTAGTTAT TGC CAT ATGG TTGTTCAGCC GCAAGCACGG 
3 51 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCGCCGCTC GTTCCGCTGG 
401 GTCTCGCTTC GGGACGTATC GGCAACTT T A TCAACGGCGA ACTTTGGGGA 
451 CGCATTACCG ACATTAACGC ATT TTGGGCA ATGGGCTTCC CGCAAGCGCA 
501 TTACGAAGAT GCCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 
551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TT AT CAGTTT 
601 GCCCTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TTTCCAAAAA 
651 ACCGCGCCCG ACCGGGCAGA CTGCCGCGCT TTTTCTCGGC GGCTACGGCG 
701 TGTTCCGCTT TATTGCCGAA TTTGCGCGCC AACCCGACGA CTATCTCGGG 
751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 
801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 
851 GA 

This corresponds to the amino acid sequence <SEQ ID 530; ORF 137.ng>: 

gl37 .pep 
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1 MIIHHQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MS FHG 

101 GFLGWIAIW LF SRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RITDINAFWA MGFPQAHYED AEAAAHNPLW AEWLQQYGML PRHPSQ LYQF 

2 01 ALEGI CLFAV VWL FSKKPRP TGQTAALFLG GYGVFRFIAE FARQPDDYLG 

2 51 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 53 1>: 

ml37 . seq 

1 AT GAT T AC C C 

5 1 TGCCGTCCGC 

101 CCTTTCTCGG 

151 GAATCGCTCG 

201 CGGGCGTTTG 

251 ATCCGCTTGA 

301 GGCTTTTTGG 

3 51 CATCGGCTTC 

4 01 GTCTCGCTTC 
4 51 CGCGTTACCG 
501 TTACGAAGAT 
551 TGCAACAATA 
601 G C ACTTGAAG 
651 ACAGCGGTCG 

7 01 TATTCCGCTT 
751 CTGCTGACCT 

8 01 TGTTTTGGGT 
851 GA 

This corresponds to the amino acid sequence <SEQ ID 532; ORF 137>: 

ml3 7 -pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGG MS FHG 

101 GFLGWIAIR LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTD I NAFWA MGFPQARYED AEAAAHNPLW AEWLQQYGML PRHPS QLYQF 

201 ALEGI CLFTV IWL FSKKQRS TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 137 shows 95.4% identity over a 283 aa overlap with a predicted ORF (ORF 137.ng) 
from N. gonorrhoeae: 

ml37/gl37 

10 20 30 40 50 60 

MITHPQFDPVLI S IGPLAVRWYALS YI LGF I LFTFLGRRRI AQGLSVFTKESLDDFLTWG 

1 1 I MM MM MMMMM MMMMMMMMM MMM MMMMMM 

MI I HHQFDPVL I SIGPLAVRWYALSY I LGF I LFTFLGRRRI AQGLSVFTKESLDDFLTWG 
10 20 30 40 50 60 

70 80 90 100 110 120 

ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIRLFGRKHGIGF 

1 1 1 1 1 1 M I 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 MMMMM 

ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIWLFSRKHGIGF 
70 80 90 100 110 120 

130 140 150 160 170 180 

LKLMDTVAPLVPLGLASGR IGNFI NGELWGRVTDI NAFWAMGFPQARYEDAEAAAHNPLW 

M M M MM MMI M M MM MMMMM U M MM M I I I I : I I I I 1 1 I I I I 1 1 I 

LKLMDTVAPLVPLGLASGR I GNF I NGELWGR I TD I NAFWAMGFPQAHYEDAEAAAHNPLW 
130 140 150 160 170 180 

190 200 210 220 230 240 

AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 



ATCCCCAATT CGATCCCGTC 
TGGTATGCCC TAAGCTACAT 
CAGAAGGCGC ATCGCGCAAG 
ACGACTTCCT GACATGGGGC 
GGTTACGTCC TGTTTTACAA 
TATTTTCAAG GTATGGGAAG 
GTGTAGTTAT TGCCATACGG 
CTCAAACT G A TGGATACGGT 
GGGACGTATC GGCAACTTCA 
ACATCAACGC ATTTTGGGCA 
GCCGAAGCCG CCGCGCACAA 
CGGTATGCTG CCGCGTCATC 
GCATCTGCCT GTTCACCGTC 
AC CGG ACAAG TCGCCTCGCT 
CATTGCCGAA TTCGCACGCC 
TGGGGCTGTC GATGGGGCAA 
ATCGTCGGCT TTGTCCGGTT 



CTTATCAGTA TCGGCCCGCT 
CCTCGGATTT ATTCTTTTTA 
GCTTGTCCGT TTTT AC C AAA 
ATTTTGGGCG TAATTTTGGG 
GTTTTCCGAC TACCTCGCCC 
GCGGAATGTC GTTCCACGGC 
TTGTTCGGCC GCAAACACGG 
CGCACCGCTC GTTCCGCTGG 
TCAACGGCGA ACTTTGGGGA 
ATGGGCTTCC CGCAGGCGCG 
TCCGCTTTGG GCAGAATGGC 
CCTCGCAGCT TTATCAGTTT 
ATTTGGCTGT TCTCTAAAAA 
CTTCCTCGGC GGCTACGGCA 
AACCCGACGA CTATCTCGGG 
TGGTTGAGCG TCCCGATGAT 
CGGCATGAAA AAACAG C ACT 



ml3 7 .pep 
gl37 

ml 3 7 . pep 
gl37 

ml3 7 .pep 
gl37 

ml3 7 .pep 



WO 99/057280 



PCT/US99/09346 



389 



al37 . seq 








1 


ATGATTACCC 


ATCCCCAATT 


CGACCCCGTC 


51 


TGCCGTCCGC 


TGGTATGCCC 


TAAGCTACAT 


101 


CCTTTCTCGG 


CAGAAGGCGC 


ATCGCGCAAG 


151 


GAATCGCTCG 


ACGACTTCCT 


GACATGGGGC 


201 


CGGGCGTTTG 


GGTTACGTCC 


TGTTTTACAA 


251 


ATCCGCTTGA 


TATTTTCAAG 


GTATGGGAAG 


301 


GGCTTTTTGG 


GTGTAGTTAT 


TGCCATATGG 


351 


CATCGGCTTC 


CTCAAACTGA 


TGGACACGGT 


401 


GTCTCGCTTC 


GGGACGTATC 


GGCAACTTCA 


451 


CGCGTTACCG 


ACATCAACGC 


ATTTTGGGCA 


501 


TTACGAAGAC 


CTCGAAGCCG 


CCGCGCACAA 


551 


TGCAACAATA 


CGGTATGCTG 


CCGCGTCATC 


601 


GCACTTGAAG 


GCATCTGCCT 


GTTCGCCGTC 


651 


ACAGCGGCCG 


ACCGGACAAG 


TCGCCTCACT 


701 


TATTCCGCTT 


CATTGCCGAA 


TTTGCACGCC 


751 


CTGCTGACCT 


TGGGGCTGTC 


GATGGGGCAA 


801 


TGTTTTGGGT 


ATCGTCGGCT 


TTGTCCGGTT 


851 


GA 







1 1 1 I M 1 1 I 1 1 M M II 1 1 1 I II 1 1 1 I h h I M II I I I M : I • 1 1 I 1 1 1 1 : M II I I 

gl37 AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKPRPTGQTAALFLGGYGVFRFIAE 

190 200 210 220 230 240. 

250 260 270 280 

ml37 pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

M I I II M 1 1 I I M I M I i I I I I M M I 1 1 I I 1 1 1 1 M I M I M 

gl37 FARQPDDYLGLLTLrGLSMGQWLSVPMIVLGIVGFVRFGMKKQKX 

250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 533>: 

CTTATCAGTA TCGGCCCGCT 
CCTCGGATTT ATTCTTTTTA 
GCTTGTCCGT TTTTACCAAA 
ATTTTGGGCG TAATTTTGGG 
GTTTTCCGAC TACCTCGCCC 
GCGGAATGTC GTTCCACGGC 
TTGTTCGGTC GCAAACACGG 
CGCACCGCTC GTTCCACTGG 
TCAACGGCGA ACTTTGGGGA 
ATGGGCTTCC CGCAGGCGCG 
TCCGCTTTGG GCAGAATGGC 
CCTCGCAGCT TTATCAGTTT 
GTTTGGCTGT TCTCTAAAAA 
CTTCCTCGGC GGCTACGGCA 
AACCCGACGA CTATCTCGGG 
TGGTTGAGCG TCCCGATGAT 
CGGCATGAAA AAACAGCACT 

This corresponds to the amino acid sequence <SEQ ID 534; ORF 137.a>: 

al37 . pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGGMSFHG 

101 GFLGWIAIW LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDINAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML PRHPSQLYQF 

201 ALEGICLFAV VWL FSKKQRP TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMIVLG IVGFVRFG MK KQH* 

ml37/al37 98.2% identity in 283 aa overlap 

10 20 30 40 50 60 

ml37 pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
I I I I I I I I I M M M I M II II II I M I I I I I I I I I I M I I I I I I M M M I I M I I I I I 
al37 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml37 pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGVVIAIRLFGRKHGIGF 
} I I | | | M I I I 1 I I II I M I 1 I I I I I I I II I I 1 I M II I I I 1 M I I I I I I M I M II II 
a l37 iLGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWIAIWLFGRKHGIGF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml37 pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 

I M M I I II I I I M I I I I I! I I I I I I I I I I I I I I I I I I M I I M I MINIMI 

a l37 LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDLEAAAHNPLW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml37 .pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGIFRFIAE 
M M I I I M II I I I 1 I M I I I I I I I I I I : I M I I I I I I I I I I I I I II fl I I I I M M I I 
a 137 AEWLQQYGMLPRHPSQLYQFALEGICLFAWWLFSKKQRPTGQVASLFLGGYGIFRFIAE 
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190 200 210 220 230 240 

250 260 270 280 

ml37 .pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 
| I I I M I I I 11 M i I I I 11 I M I I I I I I I ! 1 M I ! I I I I M I I 1 
al3 7 FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 535>: 

gl38 . seq 

1 ATGGAGTTTG AAAAC ATT AT TTCCGCCGCc gaCAAGGCGC GTATCCTTGC 

51 CGAAGCACTG CCTTACAtCC gccgGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AGTATGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GCGAATTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGAC GATGGAT AT T 

3 01 GTCGAAATGG TATTGGGCGG GCACGTCAAC AAGGAAATCG TGT CGATG AT 

3 51 TAACACATAT GGAGGGCACG CGGTCGGCGT GAGCGGGCGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGGTCGATA CGCCCGAACA GAATAGCGTG 
4 51 GACATCGGAC AGGT CGGT AC GGTGGAAAGC AT CG AT AC CG GTTTGGTTAA 
501 AGGGCTGATA GAACGCGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT GGCAGGCAAA 
601 T TGG CGG AAG AATTGAACGC CGAAAAACTC TTGAT GATG A CGAAtatcgc 
651 cgGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC acgCCGAAAC 
701 GGATTGATGG GCTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
751 AAAATCGCTT CTGCGGTCGA AGCcgccgtc aACGGTGTGA AAGCCACGCA 
801 CAT CAT CG AC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
851 ATGCCGGTAT CGGGT CGATG ATTTTAGGCA GAGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 536; ORF 138.ng>: 

gl38 . pep 

1 MEFENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RD WLLKLVG IHPVIVHGGG PQINAMLEKV GKKGEFVQGM RVTDKETMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LVDTPEQNSV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDGLIA DGTLYGGMLP 

251 KI AS AVEAAV NGVKATHI ID GRLPN ALLLE I FTDAGIG5M IL GRGEDA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 537>: 

ml38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

2 51 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

3 01 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

3 51 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 
401 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

5 01 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 
551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 
601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 
651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 
701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 
751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGTGTGA AAGCCACGCA 
801 TATCATCGAC GGCAGGTTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 
851 ATGCCGGTAT CGGTT CGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 538; ORF 138>: 

ml3 8 -pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 
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51 RDWLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDELIA DGTLYGGMLP 

251 KI AS AVE AAV NGVKATHIID GRLPN ALLLE IFTDAGIGSM IL GGGEDA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 

ml38/gl38 

10 20 30 40 50 60 

ml 3 8 . pep MESENI ISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 

1 1 1 1 1 1 1 1 1 i i 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 

g 1 3 8 MEFENII SAADKARI LAEALP YI RRFSGS VAV I KYGGNAMTEPALKEGFARDWLLKLVG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml3 8 . pep I HPV I VHGGGPQ I NAMLEKVGKKGEFVQGMRVTDKEAMD I VEMVLGGHVNKE I VSMINTY 

1 I M I M 11 1 1 1 1 1 1 1 1 1 1 ! M ! M 1 1 tt It 1 I M I ^ 11 1 1 1 1 1 M I M 1 1 11 1 1 1 M I I 

gl3 8 IHPVIVTiGGGPQINAMLEKVGKKGEFVQGMRVTDKETMDIVEMVLGGHVNKEIVSMIOTY 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml3 8 .pep GGHAVGVS GRDDH F I KAKKLL I DTPEQNGVDIGQVGTVES IDTGLVKGLI ERGCIPWAP 

I 1 1 I II 1 1 1 II I M I II I II h 1 1 1 1 1 h II I 1 1 M 1 1 1 1 M M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

g 1 3 8 GGHAVGVSGRDDHFI KAKKLLVDTPEQNSVDIGQVGTVES IDTGLVKGLI ERGCIPWAP 

130 140 150 160 170 180 



190 * 200 210 220 230 240 

ml 3 8 . pep VGVGE KG E AFN I N ADLV AG KLAE E LN AE KL LMMTN I AG VMD KTGNLLT KLT P KR I DE L I A 

IIIIIMIMMIilMIIIIMIMIIIIIMIMIIIillMMIIIIIIIM! Ml 

gl3 8 VGVGEKGEAFNINADLVAGKLAEELNAEKLLNIMTNIAGVMDKTGNLLTKLTPKRIDGLIA 

190 200 210 220 230 24 0 



250 260 270 280 290 299 

ml 3 8 . pep DGTLYGGMLPKIASAVEAAVNGVKATHI I DGRL PN ALL L E I F TD AG I G S M I LGGGE D AX 

NMIIMIIIIIIIINIMIIIIIIIIIMIIMINIIIINMIIIIII Mill 

gl3 8 DGTLYGGMLPKIASAVEAAVNGVKATHI IDGRLPNALLLE I FTDAGIGSMI LGRGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 539>: 

al38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GATGGATATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAATATCGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

701 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

7 51 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 
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This corresponds to the amino acid sequence <SEQ ED 540; ORF 138.a>: 

al38.pep 

1 MESENIISAA DKARILAEAL PYIRRFSGSV AVIKYGGNAM TEPALKEGFA 

51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGE FVQGM RVTDKEAMDI 

101 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPVVAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTN I AGVM DKTGN LLTKL TPKRIDEL1A DG TLYGGMLP 

251 KIASAVEAAV NGVKATHIID GRVPNALLLE I FT DAG I GSM ILGGGEDA* 

ml38/al38 99.7% identity in 298 aa overlap 

10 20 30 40 50 60 

ml38 . pep MESENI ISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 

I I I I I I I I I M I I M I I I I I II M I i I II I I I M M I I M M I I It I I I M M I I II i I I 
al38 MESENI ISAADKARILAEALPYIRRFSGSVAVIKYGGNAMTEPALKEGFARDWLLKLVG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml38 .pep I HPV I VHGGGPQINAMLEKVGKKGEFVQGMRVTDKE AMD I VEMVLGGHVN KEIVSMINTY 

II II I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I M I I I I I I I I M I I I I I 
al38 I HPV I VHGGGPQINAMLEKVGKKGEFVQGMRVTDKEAMDI VEMVLGGHVN KEIVSMINTY 

70 80 90 100 110 120 



130 140 150 160 170 180 

m!38 . pep GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVESIDTGLVKGLIERGCIPWAP 
I I I I I II I I I I I I I I I M I I I I I I I I I I I 1 M I II I II I I M I I I I I I I I I I I I I I I I I I 
al38 GGHAVGVSGRDDHFIKAKKLLIDTPEQNGVDIGQVGTVES IDTGLVKGLI ERGCIPVVAP 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml38 . pep VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDKTGNLLTKLTPKRI DELIA 

I! I I I I I M i M I I II I I I M I I II I I M M M 1 I M I I I I I I I I I I I M I I M I M I M 
al38 VGVGEKGEAFNINADLVAGKLAEELNAEKLLMMTNIAGVMDKTGNLLTKLTPKRI DELIA 

190 200 210 220 230 240 



250 260 270 280 290 299 

ml 38. pep DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRLPNALLLEIFTDAGIGSMILGGGEDAX 
I t I I I I I I 1 I I I I I I II I II M I I I I I I I M I : It II I I I I I I I I I I I I I II I I I I I I I 
al38 DGTLYGGMLPKIASAVEAAVNGVKATHIIDGRVPNALLLEIFTDAGIGSMILGGGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 54l>: 

gl39 . seq 

1 ATGCGAACCA CCTCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAggc ggcggcggag 

101 gcGGCACTTC TGCTCCCGAC TTTAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACGATAGC GGAAT C AG C A GCAGTATCTT ACGCCGGTAT 

201 AAAAAACGAA ATGTGCAAAG ACAG AAG CAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAAAGCCCC CCGAATCTGC 

301 ATACCGGAGA CTTTTCAAAC CCAAATGACC AATATTAAGA ATATGATCAA 

3 51 CCTCAAACCT GCAATTGAAG CAGGCTATAC AGGACGCGGG GTAGAGGTAG 

401 GTATCGTCGA TACAGGCGAA TCCGTCGGCA GCATATCCTT TCCCGAACTG 

451 TATGGCAGAA AAGAACACGG CTATAACGAA AATTACAAAA ACAAAT T AC A 

501 AAAACTATAC GGCGTATATG CGGAAGGAAG CGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 542; ORF 138.ng>: 

gl3 9 .pep 

1 MRTTSTFPTK TFKPAAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATIAESA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKI KAPRIC 

101 I PETFQTQMT NIKNMINLKP A I EAGYTGRG VEVGIVDTGE SVGSISFPEL 

151 YGRKEHGYNE NYKNKLQKLY GVYAEGSA* 
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The following partial DNA sequence was identified in N, meningitidis <SEQ ID 543>: 

ml3 9 . seq 

1 ATGCGAACGA CCCCAACCTT. CCCTACAAAA ACTTTCAAAC CGACTGCCAT . 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGTACCGG TATCGGCAGC 

151 AACAGCAGAG CAACAACAGC GAAATCAGCA GCAGTATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

3 01 TGCAT A CCGG AGACTTTCCA AACCCAAATG ACGCATtACA AGAATTTGAT 
351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

4 01 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 
451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTACG AAAAAC T AT A 
501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml3 9 .pep 

1 MRTTPTFPTK TFKPTAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNLINLK PAI EAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae: 

ml39/gl39 

10 20 30 40 50 60 

ml3 9 .pep 
g!39 



ml3 9 .pep 
gl39 



ml3 9 .pep 
gl39 

120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ED 545>: 

al39 . seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCAT ACCGG AGACTTTACA AACCCAAATG ACGCAT . ACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

4 01 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

4 51 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC. AAAAAC TATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39 . pep 

1 MRTTPTFPTK TFKPAAMALA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 



MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

llll MIIIIMhIIMIMIIIIMIMimillMIIMMIIMMIM hll 

MRTTSTFPTKTFKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATIAESA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II M M I M 1 1 II h 1 1 llllllllllllh Ihllll 

AVS YAG I KNEMC KDRSMLCAGRDDVAVTDRDAKI KAP - RI CI PETFQTQMTNI KNMINLK 
70 80 90 100 110 

130 140 150 160 170 
PAI EAGYTGRGVEVGI VDTGESVGS I S FPELYGRKEHGYNENY E KLYGVYAEGS AX 

1 1 1 1 1 M I II 1 1 M M M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 : 1 1 1 II II II 1 1 1 

PAI EAGYTGRGVEVGI VDTGESVGS I S FPELYGRKEHGYNENY KNKLQ KLYGVYAEGS AX 
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51 NSRATTAKSA AIS YAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 
101 CIPETLQTQM THXKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 
151 LYGRKEHGYN ENYXKLYGVY AEGSA* 

ml39/al39 97.1% identity in 175 aa overlap 

10 20 30 40 50 60 

ml 3 9 .pep MRTTPTFPTKTFKPTAMAIAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 
I I II I I I 1 I I I I I I : M I I I I M II I t I I I I I I i I i I I I II I M I I II I It I I It I II I I 
al3 9 MRTTPTFPTKT FKPAAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml39 .pep AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

I : I I I I I I I II I I I I II I I I II I I I I t I I 1 I I I I II I I I M I I II : I I I M I I i I I I I i 
al39 AISYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETLQTQMTHXKNLINLK 

70 80 90 100 110 120 

130 140 150 160 170 

ml39.pep PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYEKLYGVYAEGSAX 

II I I I I I I I I I I M M II I I I M I I I I I I I II I I It I I I I I I I I I I I I I II I I I I 
al3 9 PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYXKLYGVYAEGSAX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 547>: 

gl40 . seq 

1 Atgtcggcac gCGGCAAGGG GGCAGgctat ctcAACAGTA CCGGACGACa 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 AAAATATCAA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCCG TCCTATTATG TCCGTCGCGG 

2 01 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 
251 TGAAACACGC CGTAGAACAG GGCGGCAGCA AT CTGGAAAA CCTGATGGTC 

3 01 GAGCTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 
3 51 GGTCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCTACGG CGCACAACTT 
401 TCCGCACAGC GGCAGCCGTA CAGCATGCGA ATACCGCCGA CGGCGTACGc 
451 aTCTTcaaCA GTCTCGCCGC TAccgTCTAt GccgACAGTG CCGCCGCCCA 
501 TGccgATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 
551 ACAACGGTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 
601 ACGTGGGAAC AGGGCGGTGT CGAAGGCAAA ATGCGCGGCA GTACCCAAAC 
651 TATCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 
701 TGGGCATAGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 
751 G ACAG CAT T A GTCTGTTTGC AGGCATACGG CACGATGTGG GCGATATCGG 
8 01 CT AT CT C AAA GGCCTGTTCT CctaCGGACG CTACAAAAAC AGCATCAGCC 
851 GCAGCACCGG TGCGGATGAA TATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG TGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
951 AGATTTGACG GTTGAAGGCG GTCTGCGCCA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCagt GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGT CTG AAACTGT CGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTCTG CGACGGCGGG CGTGGAACGC GACCTGAACG 

1151 G ACG C G ACT A CGCGGTAACG GGCGGCTTTA CCGGCGCGGC TGCAGCAACC 

12 01 GGCAAGACGG GTGCACGCAA TATGCCGCAC ACCCGCCGGG TTGCCGGTCT 
1251 GGGGGTGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 
1301 GCTACACCGG TTCCAAACAG TACGGCAACC ACAG CGGACA AAT CGGCGT A 

13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ID 548; ORF 140.ng>: 

gl4 0 . pep 

1 MS ARG KG AG Y LNSTGRHVPF LSAAKIGQDY SFFKNIKTDG GLLASLDSVE 

51 KTAGSEGDTP SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENItMV 

101 ELDASESSAT PETVETAVAD RTDMPGIRLR RTTFRTAAAV QHANTADGVR 

151 IFNSLAATVY ADSAAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 

2 01 TWEQGGVEGK MRGSTQTIGI AAKTGENTTA AATLGIGRST WSENSANAKT 



WO 99/057280 



PCT/US99/09346 

395 



251 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VL S AT AG VE R DLNGRDYAVT GGFTGAAAAT 
401 GKTGARNMPH TRRVAGLGVD VEFGNGWNGL ARYSYTGSKQ YGNHSGQIGV 
451 GYRF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 549>: 

ml40 . seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA CCGGACGACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG G C AGG ATT AT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

2 01 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC AT C CGCAAC A CCCGAGACGG TTGAAACTGC 

351 GGCAGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

401 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGG AC C 

551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTG CAAA TGCAAAAACC 

751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG G CG AT AT C GG 

801 CT AT CT CAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 

901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 

951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 ' CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

1051 GAAGGCACGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

12 01 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGTCTGG TTGCCGGCCT 
1251 GGGCG CGG AT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

13 01 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 
13 51 GGCTACCGGT TCTGA 

This corresponds to the amino acid sequence <SEQ ED 550; ORF 140>: 

ml4 0 -pep 

1 MSARGKGAGY LNSTGRRVPF LSAAKIGQDY SFFTNIETDG GLLASLDSVE 
51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
101 ELDASESSAT PETVETAAAD RTDMPGIRPY GAT FRAAAAV QHANAADGVR 
151 IFNSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
201 TWEQGGVEGK MRGSTQTVGI AAKTG ENTT A AATLGMGRST WSENSANAKT 
251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 

3 01 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 
451 G YRF * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 140 shows 94,5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N. gonorrhoeae: 

ml40/gl40 

10 20 30 40 50 60 

MS ARG KGAG Y LN S TGRR V P FLS AAK I GQDYSFFTN I ETDGGLLASLDSVE KTAGSEGDTL 

i I I l: I I I M M I !iM I II' ; I Ml I I I M I I : 1 1 II 1 1 1 I M 11 I I I II II I I I 

MS ARGKGAG YLN STGRHVP FLS AAKI GQDY SFFKNI KTDGGLLAS LDS VE KTAGS EGDTP 
10 20 30 40 50 60 

70 80 90 100 110 120 



ml40 .pep 
gl40 
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ml4 0 . pep 
gl40 



SYWRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAAAD 

1 M I I M I I M ! 1 1 1 1 I 1 1 M 1 I E t I I M 1 11 I I I I 1 I 1 1 i t I I I 1 1 M 1 I 1 1 1 M 1 = I I 

SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASESSATPETVETAVAD 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml4 0 pep R TDM PG I R P YG AT F RAAAAVQHANAADGVR I FN S LAATV Y AD S T AAHADMQG RRL KAV S D 

M M I II I :|||:|llllMhMMMMIIMIIIIIhllllMIII!llllll 

gl40 RTDM PG I RLRRTT FRT AAAVQHANT ADG VR I FNS LAATV Y ADS AAAHADMQGRRL KAVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml4 0 pep GLDHNGTGLRV I AQTQQDGGTWEQGGVEGKMRGSTQTVG I AAKTGENTT AAATLGMGRST 

llllllllllllMIIIIIIMIIIIMMIIIIIIhllllllllllMllllhllll 

g l4 0 GLDHNGTGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTIGIAAKTGENTTAAATLGIGRST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml40 pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 

MIIIMIIMIIIIIMMIhllllM Mlllllll 1 1 1 1 1 1 I = I i 1 1 M I 1 1 

g 1 4 o WSENS ANAKTDS I S LFAG I RHDVGDI GYLKGLFS YGRYKNS I S RSTGADE YAEGS VNGTL 

250 260 270- 280 290 300 



310 320 330 340 350 360 

ml4 0 pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 

I M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 M ! = f I M 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 M M 1 1 M 1 1 I 

g ! 4 0 MQLGALGGVNVP FAATGDLTVEGGLRHDLLKQDAFAEKGS ALGWSGNS LTEGTLVGLAGL 

310 320 330 340 350 360 

3 70 380 3 90 400 410 420 

ml 4 0 pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 

I M 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 M I I I = I 1 1 1 M 1 1 = I I 1 1 1 ! I I M 1 1 1 1 k 11111 = 1 

gl4 o KLSQPLSDKAVLSATAGVERDLNGRDYAVTGGFTGAAAATGKTGARNMPHTRRVAGLGVD 

370 380 390 400 410 420 



430 440 450 

ml 4 0 pep VE FGNG WNGLAR Y S Y AG S KQ YGNH S GRVGVG YR FX 

M IN I III MMhi UN MIM-MI'I I I 

g i4 0 VEFGNGWNGLARYSYTGSKQYGNHSGQIGVGYRFX 

430 440 450 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 55 1>: 

al40 . seq 

1 ATGTCGGCAG GCGGTAAGGG GGCAGGCTAT CTCAACCGTA CCGGACAACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCGGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGTCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GTAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

351 GGCCGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 

4 01 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 

4 51 AT CTTC AAC A ATCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 

501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 

551 ACAACGCTAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 

601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 

651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 

7 01 TGGGCATGGG ACACAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 

7 51 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 
801 CT AT CT C AAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 

8 51 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
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951 AGATTTGACG GTCGAAGGCG 

1001 CATTCGCCGA AAAAGGCAGT 

1051 GAAGGCACAC TGGTCGGACT 

1101 CGATAAAGCC GTCCTGTTTG 

1151 GACGCGACTA CACGGTAACG 

1201 GGCAAGACGG GGGCACGCAA 

1251 GGGCGCGGAT GTCGAATTCG 

1301 GCTACGCCGG TTCCAAACAG 

1351 GGCTACCGGT TCTGA 



GTCTGCGCTA CGACCTGCTC AAACAGGATG 
GCTTTGGGCT GGAGCGGCAA CAGCATCACT 
CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 
CAACGGCGGG CGTGGAACGC GACCTGAACG 
GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 
TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 
GCAACGGCTG GAACGGCTTG GCACGTTACA 
TACGGCAACC ACAGCGGACG AGTCGGCGTA 



This corresponds to the amino acid sequence <SEQ ID 552; ORF 140. a>: 

al40 . pep 

1 MSAGGKGAGY LNRTGQRVPF LSAAKIGRDY SFFTNIETDG GLLASLDSVE 

51 KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 

101 ELDASESSAT PETVETAAAD RTDMPGIRPY GAT FRAAAAV QRANAADGVR 

151 IFNNLAATVY ADSTAAHADM QGRRLKAVSD GLDHNATGLR VIAQTQQDGG 

201 TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGHST WSENSANAKT 

251 DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 

301 MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSIT 

351 EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 

4 01 GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 

4 51 GYRF* 

ml40/al 40 98.2% identity in 454 aa overlap 

10 20 30 40 50 60 

ml 4 0 . pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 
111 I 11 I M I I II : I I I I I I I i I I I : I I I I I M I 1 I II I I M II I I I I I I II I 1 1 I I I 
al40 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 40 .pep S Y YVRRGN AART AS AAAHSAPAGLKHAVEQGGSNLENLMVELDASES SAT PETVETAAAD 

M M I I [ I M I I I I I! f i I I I I t I I I M II i I I I M I II I I I I I II I I I I I M I I I I 1 I f 
al4 0 SYYVRRGNAARTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASES SAT PETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 0 . pep RTDMPGIRPYGATFRAAAAVQHANAADGVRIFNSLAATVYADSTAAHADMQGRRLKAVSD 
1 I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I : I I I I I I I I I I I I I I I I I I I M M I I I 
al4 0 RTDMPGIRPYGATFRAAAAVQHANAADGVR IFNNLAATVY AD ST AAHADMQGRRLKAVSD 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVI AQTOQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRST 

I I I I I : 1 I I I I I I I I I I ! I 11 I! I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I i I : II 
al4 0 GLDHNATGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml40.pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I I I I I I I 1 I I I I I I I I 1 I I I I I M I I I I I I 1 I M I I 1 I i I I I I I I It M I I I I I M M I I 
al4 0 WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml40 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSLTEGTLVGLAGL 
I 1 I I I I I II I II I 11 I I I I I I M I M I I I I I I I I I II I I I I I I I! I i I : I I I I I M I I I I 
a!4 0 MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNSITEGTLVGLAGL 
310 320 330 340 350 360 



370 380 390 400 410 420 

ml 4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
I I I I I I M I I I I M I I I I I I II I I I 11 I I I I I I I I I I I 1 I I I I I I I I I I I I It I I M 1 I I 
al4 0 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
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370 380 390 400 410 420 

430 440 450 

ml 4 0 . pep VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 
II M I 11 I I M M M H { M M M I I M M I M If 
al4 0 VEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRFX 

430 440 450 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 553>: 

gl4l . seq 

1 atgagcttca aAAccgATGC CGAAACCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAGCTG 

151 CCGCAAAAAC AAGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

2 01 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 
251 G C CAT AT CGG CAAAG AC T CT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 
301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ACGCGCAAGT 

3 51 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGCGAC TTCCACGCCA 

4 01 TCGGTGCGGC GAATAACCTC CTCGCCGCCA TGCTCGACAA C C AT AT CTAC 

4 51 CAAGGTAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

5 01 GGTCGATATG AACGACCGCC AGTTGCGCAA CAT CAT C GAC GGTATGGGCA 
5 51 AGCCTGTtga cggCGTGATG CGtCCcGACG GCTTCGACAT CACCGTCGCC 
601 TCCGAAGTGa tggcgGTATT CTGCCTTGCC AAAGACATCA GCGATTTGAA 
651 AGAGCGTTtt gGCAATATTC TCGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
701 TTTACGCCAA AGATTTGAAG GCACACGGCG CGATGGCGGC ATTGCTAAAA 

7 51 GATGCGATTA AGCCCAATTT GGTGCAAACC AT CGAAGGCA CTCCGGCCTT 

8 01 TGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTTA 
851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 
901 GGCTTCGGCG CGGACTTGGG TGCGGAAAAA TTCTGCGACA TCAAATGCCG 
951 CCTTGCCGGT TTGAAACCTG ATGCGGCAGT CGTCGTGGCG ACTGTCCGCG 

10 01 CCCTGAAATA CAACGGCGGC GTGGAACGCG CCAACCTTGG TGAAGAAAAC 

1051 CTCGAAGCCT TGGCAAAAGG TTTGCCCAAC CTGTTGAAAC A CAT TT C CAA 

1101 CCTGAAAAAC GTATT CGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCG AG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GCGGCGCGGG 

1251 CGG CGCGGAT TTGGCGCGCA AAGTCGTCAA TGCCATCGAC AACCAACCTA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

13 51 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTCG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CCTCGCTGGA AAAACTGGGC TTGGACAAAA 
14 51 TGCCGATCTG CAT GGCG AAA ACCCAATATT CATTGAGCGA CAACGCCAAA 
1501 CTCTTGGGCT GCCCCGAAGG CTTCCGCATC GCCGTACGCG GTATCACTGT 
1551 TTCCGCCGGC GCGGGCTTCA TCGTTGCGTT GTGCGGCAAT ATGATGAAAA 
1601 TGCCGGGCCT GCCGAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGAA 
1651 CACGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 554; ORF 141. ng>: 

gl41 .pep 

1 MS F KTDAE T A QSSTMRPIGE IAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPDGFDITVA 

201 SEVMAVFCLA KDISDLKERF GNILVAYAKD GSPVYAKDLK AHGAMAALLK 

251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

3 01 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

3 51 LEALAKGLPN LLKHISNLKN VFGLPVWAL NRFVSDSDAE LAMIEKACAE 
401 HGVEVSLTEV WGKGGAGGAD LARKWNAID NQPNNFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 
501 LLGCPEGFR I AVRG I TVS AG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDE 
551 HGVIHGLF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 555>: 
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ml 4 1 . seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAATGCCGAC AACATTG AG C 

101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 

251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 

3 01 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

3 51 TTTGCCGATG G AAG A CAT CA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

4 51 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

5 01 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGTA 
551 AACCCGTTGA CGGCGTGATG CGTCCTGACG GTTT CGATAT TACCGTTGCT 
601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 

6 51 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
701 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 
851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 
901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 
951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

12 51 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTT CGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGAC AAAAT C 

1351 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 

14 51 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 

1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 

1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 

1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 

1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 556; ORF 141>: 
ml41 .pep 

1 MSFKTDAE I A QSSTMRPIGE I AAKLGLNAD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDA VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFD I TVA 

201 SEVMAVFCLA KDISDLKERL GN I LVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAI KPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDADAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDF RI AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 

551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141.ng) 
from N. gonorrhoeae: 

ml41/gl41 

10 20 30 40 50 60 

ml41 .pep MSFKTDAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

I I 1 1 1 1 1 1 Ml Ml I! II !l I I ! Ml |: ! || ! II I!! MMi 1 !! M! HI ! M II ! I! 

gl41 MSFKTDAETAQSSTl^PIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml 4 1 . pep T A I N P T PAG EG KTTVT I G LADALRH I G KD A V I ALR EPS LG P V FG V KGG AAGGG Y AQ VL PM 

I I I II I II M 1 1 1 1 1 1 II II I 1 1 1 M 1 1 1 : 1 M I M M 1 1 II M II 1 1 I I'M I I 1 1 1 M I 

gl41 TAINPTPAGEGKTTVTIGLADALRHIGKDSVIALREPSLGPVFGVKGGAAGGGYAQVLPM 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml4l .pep EDINLHFTGDFHAI GAANNLLAAMLDNHI YQGNELNI DPKRVLWRRVVDMNDRQLRNI ID 

IIIIIMIIIIIMIIIMIIMIIMIIIIMIIIMIIMIIIIINIMIMIIMI 

gl41 ED I NLHFTGDFHAI GAANNLLAAMLDNHI YQGNELNI DP KRVIjWRRVVDMN^ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml41 .pep GMGKP VDG VMR PDG FD I TV AS E VMAV F CLAKD I S D L KE RLGN I L V AY AKDG S P VY AKD L K 

MIIIIIIIMIMMIIMIIIIIIMIIIIIMIllhMMIMMIII IMIIIII 

g 1 4 1 GMGKPVDGVMRPDGFD I TV AS E VMAV F CLAKD I SDLKERFGN I LV AYAKDGS P VYAKDLK 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 4 1 . pep ANGAMAALLKDAI KPNLVQTI EGTPAFVHGGPFANI AHGCNS VTATRLAKHLADYAVTEA 

hlllllllllllllMIIMIIIMIMIIIMIIIIIMMIIIMIIIIIIIIIIM 

gl41 AHGAMAALLKDAI KPNLVQTI EGTPAFVHGGPFANI AHGCNS VTATRLAKHLADYAVTEA 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml41 .pep GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 

M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 11 M 1 1 1 1 1 11 1 1 M I II M ! I M M ^ I I 1IIII 

g 1 4 1 GFGADLGAEKFCDI KCRLAGLKPDAAWVATVRALKYNGGVERANLGEENLEALAKGLPN 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml4 1 . pep LLKHISNLKNVFGLPVWALNRFVSDADAELAMIEKACAEHGVEVSLTEWGKGGAGGAD 

M M M M M 1 1 M M II M i I M 1 1 • 1 1 1 M I N 1 1 1 II I M M 1 1 1 1 1 1 1 M 1 1 M II 

gl4 1 LLKHI SNLKNVFGLPVWALNRFVSDSDAELiAMI EKACAEHGVEVS LTEVWGKGGAGGAD 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml41 .pep LARKWNA I E S QTNN FG F A YDVE LG I KDKI RAI AQ KVYGAED VD F S AE AS AE I AS L E KLG 

MllilllhM IIIIMIIIIIMIIMMIIIMIIMIIIIIMIIIIIMIIIII 

gl41 LARK\A^AIDNQPNNFGFAYDVELGIKDKIRAIAQKVYGAEDVDFSAEASAEIASLEKLG 

430 440 450 460 470 480 



490 500 510 520 530 540 

ml4 1 . pep LD KMP I CMAKTQ YS LS DN AKLLG CPEDFR I A VRG I TVS AG AG F I VALCGNMMKMPGLPKV 

I I I Mill Ml III Mill I I I MM IIMM II III I M Ml Mill MM il Mill 

gl41 LD K>1P I CMAKTQYSLSDNAKLLGCPEGFR I AVRG I TVS AGAGF I VALCGNMMKMPGLPKV 

490 500 510 520 530 540 



550 559 
ml41 .pep P AAE K I D VD AE GV I HGLFX 

IMIIIII! MINIMI 

g 1 4 1 PAAEKI DVDEHGVIHGLFX 

550 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 557>: 

al41 . seq 

1 ATGAGTTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTGGGTTT GAACGTTGAC AACATTGAGC 

101 CTTACGGTCA TTACAAAGCC AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 
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2 01 GGCGGGCGAA GGTAAAACCA CCGTAACCAT CGGTTTGGCG GACGCATTGC 

251 GCCATATCGG CAAAGACTCT GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 

301 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

4 01 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

4 51 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CATCATCGAC GGCATGGGCA 

551 AGCCTGTTGA CGGCGTGhTG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAG AC AT C A GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

7 01 TTTACGCCAA AGATTTGAAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 

7 51 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

8 51 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 
901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 
951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACTC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CACGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

13 51 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 
14 51 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 
1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 
1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
1601 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

lis corresponds to the amino acid sequence <SEQ ID 558; ORF 141.a>: 

al41 , pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNVD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDS VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPD GFDITVA 

201 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 

251 DAIKPNLVQT IEGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 

301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

351 LDALEKGLPN LLKHISNLKN VFGLPWVAL NRFVSDSDAE LAMIEKACAE 

401 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTONFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIASLEKLG LDKMPICMAK TQYSLSDNAK 

501 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 

551 EGVIHGLF* 

141/al41 99.5% identity in 558 aa overlap 

10 20 30 40 50 60 

ml 41. pep MSFKT DAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

I I I I M I I I I! I I I I I i f I I I I I i I i i I : I I I I i I I I i i I I i i I I I I i ! M t I I I I I I I 1 
al41 MSFKTDAEIAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 1 . pep TA IN PTPAGEGKTTVT I GLADALRHIGKDAVIALRE PS LGPVFGVKGGAAGGGYAQVLPM 

M I I I I II M I i I I I I M I I i I I M I I I 1 : I I I I M ! 1 I ! I I I I I 1 M I t I ! I I I I I I I I 
al41 T AI N PTPAGEGKTTVT I GLADALRH I GKDSV I ALRE PS LGPVFGVKGGAAGGGYAQVLPM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 41 .pep EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRWDMNDRQLRNIID 
M I I I i I I I I I I I I I I I I I I It I I I I 1 I I I I I I 11 I 11 I I I I I I I I I M I I I I I I I I I I I 
a 141 EDINLHFTGDFHAIGAANNLLAAMLDNHIYQGNELNIDPKRVLWRRWDMNDRQLRNIID 

130 140 150 160 170 180 
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190 200 210 220 230 240 

ml 41 . pep GMGKPVDG VMRPDGFDIT VASE VMAVFCLAKDISDLKERLGNILVA YAK DGSPVYAKDLK 

I I I I M I I I I I I M I I II I I I I I I I I I I i 1 M I I M II II M I I i I I I Ml I I II I I I ! I 
al41 GMGKPVDGVMRPDGFDITVASEVMAVFCLAKDISDLKERLGNILVAYAKDGSPVYAKDLK 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 41 .pep ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 
I I I I I I M t I I I I I I I I I I I I I I I I I I I I I I I I I M II II I I I M I I I M M I I I I I II I 
al41 ANGAMAALLKDAIKPNLVQTIEGTPAFVHGGPFANIAHGCNSVTATRLAKHLADYAVTEA 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 1 . pep GFGADLGAEKFCDIKCRLAGLKPDAA\AA?ATVRALKYNGGVERANLGEENLDALEKGLPN 
II I I I M I I I I M I M I I I I I I I I I I I I I I I I I I II I I I i I 1 I I M 1 I I I I I I I I I I I I I 
al41 GFGADLGAEKFCDIKCRLAGLKPDAAVWATVRALKYNGGVERANLGEENLDALEKGLPN 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 41 .pep LLKHISNLKNVFGLPVWALNRFVSDADAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 
I I I !! I I I I I I I 1 ! I I I I M I ! I M I I I I I I I I M I I I ! I I I I I 1 I I I I I I ! I ) I ) I I 
al41 LLKHISNLKNVFGLPVWALNRFVSDSDAELAMIEKACAEHGVEVSLTEVWGKGGAGGAD 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml4 1 . pep LARKVVNAIESQTNNFGFAYDVELGIKDKIRAIAQICVYGAEDVDFSAEASAEIASLEKLG 
II I M M I I I I II II I M 1 I I I I I I I II I I II M I II I M I II I I I I I II I I I M I M I I 
al41 LARKWN AI E S QTNN FG FAY D VE LG I KDKI RAI AQKV YG AEDV D FS AE AS AE IAS LEKLG 

430 440 450 460 470 480 



490 500 510 520 530 540 

ml 41 .pep LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 
I M I f I M M I II M M I I II M f II I I I M I I I 1 I M I I I M I I! M II I I M I f I II I 
al41 LDKMPICMAKTQYSLSDNAKLLGCPEDFRIAVRGITVSAGAGFIVALCGNMMKMPGLPKV 

490 500 510 520 530 540 



550 559 
ml 4 1 . pep PAAEKI DVDAEGVIHGLFX 

I I I I I I I I I I I I I I I II I I 
a 1 4 1 PAAEKI DVDAEGVIHGLFX 

550 



The following partial DNA sequence was identified in N ^gonorrhoeae <SEQ ID 559>: 

gl42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCTTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAAATATGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCAT ATTGATGCAG AGgCTGCCGT 

201 TTTCCGACAG GATcggaATG AttcgCGCAC TCCGGTTTAT G C AC AG CAT C 

251 ACGGTCGGCG GCTCGTCGGT AACCGGCG C A ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC CGTCCCCTAT ATAAGAATGC TGCACACAAG 

4 51 GCATCCCCCC ATGTGCAGCA GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 560; ORF 142.ng>: 

gl42 .pep 

1 MRADFMFADN MPVQVRQRAF YFK1»SRFAAM PNMVGKPLFG RQAGQPGKMF 

51 GNILMFVRQH IDAEAAVFRQ DRNDSRTPVY AQHHGRRLVG NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN RPLYKNAAHK 

151 ASPHVQQF* 



WO 99/057280 



403 



PCT/DS99/09346 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 561 : 



ml42 . seq 








1 


ATGCGTGCCG 


ATTTCATGTT 


TGC CGACAAT 


51 


ACGCGCCCTC 


TATTTCAAGT 


TGTCCCGTTT 


101 


TAGG CAAACC 


GCTCTTCGGG 


CGACAGG C CG 


151 


GGCAACATCC 


TGATGTTCGT 


CCGCCAGCGT 


201 


TTTCCGACAG 


GATCGGAATG 


ATTCGCGCAC 


251 


ACGGTCGGCG 


GCTCGTCGGT 


AACCGGCGCG 


301 


GTAACGCCCT 


GCCGCACCGT 


CTGTCGTGAT 


351 


AAGATGCCAT 


CGCATCACGG 


AACGAAGTTT 


401 


GCGATTTTTC 


CCCTTTAAAC 


TGTCCCCTAT 


451 


GCATCCCCcC 


ATGTGCAGCA 


GTTTTGA 



This corresponds to the amino acid sequence <SEQ ID 562; ORF 142>: 

ml4 2 .pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCNA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N. gonorrhoeae: 

ml42/gl42 

10 20 30 40 50 60 

ml42 . pep MRADFMFADNMPVQVRQRALYFKLSRFAAMPDVVGKPLFGRQAGQPGKMFGNILMFVRQR 

1 1 1 1 M 1 1 1 1 M 1 1 M M I • M I M M N I h : M 1 1 II M I M M M M I M I M I M - 

gl42 MRADFMFADNMPVQVRQRAFYFKLSRFAAMP^^WGKPLFGRQAGQPGK>IFGNILMF T VRQH 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 2 . pep I DAE AAV FRQDRND S RT P VD AQHHGRRL VGNRRDRRH CN AVT P C RTVCRDDMN ACRARCH 

IIIIII1I1MIIIIIIII II Mill IIM I hill II I I MMMI II 1 : I I I h M 

gl4 2 I D AEAAV FRQDRNDS RT P VY AQHHGRRLVGNRRNRRH CN AVT P C RTVCRDDMN ACRT GCH 

70 80 90 100 110 120 

130 140 150 159 

ml4 2 .pep RITERSLKI FLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

IIIIIIM MIMMIIM MIUMIIIMIMIII 

g 1 4 2 R I TERSLKS FLQ I RHFS PLNRPL YKNAAHKAS PHVQQFX 

130 140 150 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 563>: 

al42 . seq 

1 ATGCGTGCCG ATTTCATGTT TGC CGACAAT ATGCCCGTGC AGGTGCGCCA 

51 ACGCGCCCTC TATTTCAAGT TGTCCCGTTT TGCCGCGATG CCAGATGTGG 

101 TAGGCAAACC GCTCTTCGGG CGACAGGCCG GTCAGCCCGG CAAAATGTTC 

151 GGCAACATCC TGATGTTCGT CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 

201 TTTCCGACAG GATCGGAATG ATTCGCGCAC TCCGGTTGAT GCACAGCATC 

251 ACGGTCGGCG GCTCGTCCGT AACCGGCGCA ACCGCCGTCA TTGTAATGCC 

301 GTAACGCCCT GCCGCACCGT CTGTCGTGAT GACATGAACG CCTGCCGCAC 

351 AGGATGCCAT CGCATCACGG AACGAAGTTT GAAAAGTTTT CTGCAAATCC 

4 01 GCCATTTTTC CCCTTTAAAC TGTCCCCTAT ATAAGAATGC TGCACACAAG 

4 51 GCACCCCCCA TGTGCAGCAG TTCTGATTCA AAAAGCCGTC GGTCGGACAT 

501 TTCCGCGCGT TACGGCGTAT TACGAGTTCA ACGCATCCTC GATTTTGGCA 

551 AGTTCTGCCA ACAGGTCTTT AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 

601 TTCCTCGATA GCGTCGTAAC GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 

651 ACAGCTTCTC GCCCTCGGCA GTCAGCTTCA GAAAAACACG TCGTTGGTCG 

7 01 TTGGAAGGTT TCAGGCGGAC AACCAAACCC GCTTTTTCAA GGCGGGTCAG 

7 51 GATACCGGTC AGGCTGGGGC GCAAAATGCA CGCCTGATTC GCCAAATCTT 
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801 GAAAGTCCAG CGTGCCGTTT TCCGCCAAAA GACGGATAAT CCGCCATTGC 

851 TGATCGGTAA TATTCGCCTG ATTCAGAATA GGCCTGAATT GGGTCATCAG 

901 GGCTTCCCTT GCCTGTATCA GACCGATATT GATAGACGCA TGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 564; ORF 142.a>: 

al4 2 . pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 

51 GNILMFVRQR I DAE AAV FRQ DRNDSRT PVD AQHHGRRLVR NRRNRRHCNA 

101 VTPCRTVCRD DMNACRTGCH RITERSLKSF LQIRHFSPLN CPLYKNAAHK 

151 APPMCSSSDS KSRRSDISAR YGVLRVQRIL DFGKFCQQVF KQQHFLAAQH 

201 FLDS VVTLVH FFADFLIQLL A LGSQLQKNT SLVVGRFQAD NQTRFFKAGQ 

2 51 DTGQAGAQNA RLIRQILKVQ RAVFRQKTDN PPLLIGNIRL IQNRPELGHQ 

301 GFPCLYQTDI DRRMF* 



ml42/al42 96.1% identity in 153 aa overlap 

10 20 



30 



40 



50 



60 



ml 4 2 . pep MRADFMFADNMPVQVRQRALYFKLSRFAAMPDVVGKPLFGRQAGQPGKMFGNILMFVRQR 
I M I M I I ! I 1 I I I I I 1 I I I I I I! M I M I I I 11 I I I I I ! I I I I I I I I I M I I II I II I I 
a!42 MRADFMFADNMPVQVRQRALYFKLSRFAAMPDWGKPLFGRQAGQPGKMFGNILMFVRQR 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 2 . pep IDAEAAVFRQDRNDSRTPVDAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 
I I I I I M I I It ) I I I I I I I I I I I 1 I I I I 1 I I I : I 1 M I I I I I I I I I M M II I I I : II 
al4 2 IDAEAAVFRQDRNDSRTPVDAQHHGRRLVRNRRNRRHCNAVTPCRTVCRDDMNACRTGCH 

70 80 90 100 110 120 

130 140 150 159 

ml42 .pep RITERSLKIFLQIRHFSPLNCPLYKNAAHKASPHVQQFX 

MINIM I M 1 I II M I I M I I M M M I I 
al4 2 RITERSLKSFLQIRHFSPLNCPLYKNAAHKAPPMCSSSDSKSRRSDISARYGVLRVQRIL 

130 140 150 160 170 180 



al42 



DFGKFCQQVFKQQHFLAAQHFLDSWTLVHFFADFLIQLLALGSQLQKNTSLVVGRFQAD 
190 200 210 220 230 240 



The following partial DNA sequence 



gl43 . seq 






1 


ATGTTGAGCT 


TCGGCTATCT 


51 


CTCGCAGATG 


AGCCGCATTT 


101 


TGGGCTGGTT 


TTTCATCCTG 


151 


ATAGTGgGCT 


ACTACTCAGA 


201 


CCGCCTGCCG 


TATCTGCTTT 


251 


TTTTGATGCC 


GAACTCGGGC 


301 


GCCTTGTCGT 


TCGGCGCGCT 


351 


TATGGCGATG 


CAGCCGTTTA 


401 


AGCAGAAAAG 


CTACGCCTAC 


451 


GCGGTTGTGG 


CAGCGATTCT 


501 


GAACACTGCC 


GAGAAAGGCG 


551 


ATGTGGGTGC 


GGCGTTACTG 


601 


GTCAAAGAAT 


ACGACCCGGA 


651 


CGCCGCGAAT 


CAGGAAAAAG 


701 


CTAAAGTGTT 


TTGGACGGTT 


751 


TTCCGGTATA 


TGTGGACTTA 


801 


GCACACTACC 


GATGCGTCTT 


851 


ACGGCGTTTT 


QGCGGCGGTG 


901 


ATTCTGGCAA 


AAGTACCGAA 


951 


TTTGGCTTTG 


GGCGCGCTCG 


1001 


AATACGCACT 


CATCCTGTCT 


1051 


ATTATCACTT 


ATCCGCTGAC 


1101 


CATGGATACT 


TATTTGGGCC 


1151 


tcgTcgctTC 


GctgttgAGT 



identified in N. gonorrhoeae <SEQ ID 565>: 

CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 
TTCAAACGCT AGGCGCAGAC CCGCACAATT 
CCGCCGCTGG CGGGGATGCT GGTTCAGCCG 
CCGCACTTGG AAGCCGCGCT TGGGCGGCCG 
ACGGCACGCT GATTGCGGTC ATCGTGATGA 
AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 
GATGATTGCG CTGTTGGACG TGTCGTCGAA 
AGATGATGGT CGGCGATATG GTCAACGAGG 
GGGATTCAAA GTTTCTTAGC GAATACGGAC 
GCCGTTTGTG TTcgcgtata TCGGTTTGGC 
TTGTGCCACA AACCGTGGTC GTAGCATTCT 
ATTATTACCA GTGCGTTCAC AATCTCCAAA 
AACCTACGCC CGTTACCACG GCATCGATGT 
CCAACTGGTT CGAACTCTTA AAAACCGCGC 
ACTCCGGTAC AGTTTTTCTG CTGGTTCGCC 
CTCGGCAGGC GCGATTGCAG AAAACGTCTG 
CCGTAGGCCA TCAGGAGGCG GGCAACCGGT 
TAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 
TAAATACCAT AAGGCGGGTT ATTTCGGCTG 
GTTTCTTCTC TATCTTCTTC ATCTACAATC 
T AT AT CT T AA TCGGCATCGC TTGGGCGGGC 
GATTGTGGCC AACGCTTTGT CGGGCAAACA 
TGTttaacgg ctctgtCTGT ATGCcgcaaa 
TTCGTGCTTT TCCCGATGCT GGGCGGCCAT 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 

1251 CTCAGTCTGT CTGATTAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 

gl4 3 . pep 

1 MLSFGYLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGYYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFLANTD 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT W VAFYVGAALL IITSA FTISK 

201 VKEYDPETYA RYHGIDVAAN QEKANWFELL KTAPKVFWTV TPVQFFCW FA 

251 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRYG VLAAV *SVAAVICSF 

301 ILAKVPNKYH KAG YFGCLAL GALGFFSIFF IYN QYALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLFN GSVC MPQIVASLLS FVL FPMLGGH 

4 01 QATM FLVAGA VLLLGAFSVC L IKEIHGGV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml 4 3 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

7 01 CTAAGGCGT^ TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

7 51 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 . pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVG HYSPRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGNQ YALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

4 01 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml43 / gl43 93.9% identity in 429 aa overlap 

10 20 30 40 50 60 
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ml 4 3 pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 

I | | | | : | | | | | M I II I t I I I I I I M I I I I t I I I I I I I I I I I I I M I M II I I : I I I I I I 
ql43 MLSFGYLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGYYSDRTW 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 3 pep K PRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
M I I I I ! I I ! M I M I M I M I I I I I I I I M I I I I I I I I I M I M I I I I I M I I! I I I I I 
ql43 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3 pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAVVAAILPFVFAYIGLANTAEKGWPQTVV 

| | M | | | | M M I I I I : M I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I M 
ql43 QPFKMMVGDMVNEEQKSYAYGIQSFLANTDAWAAILPFVFAYIGLANTAEKGWPQTW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 3 pep VAFYVGAALLV I T S AFT I FKVKE YDPET YARYHG I DVAANQEKANW I E LLKTAPKAFWT V 

|| : | | | | I I I I I I I I I I I I I I I II I M I I I I I I M I I : I I I I I I I I : I I I I 

ql43 VAFYVGAALLIITSAFT I SKVKE YDPET YARYHG I DVAANQEKANWFELLKTAPKVFWTV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 4 3 pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I M M | | M I : I I I I I I I I I I M 1 1 M I I I I I I I I : I I I I I 11 I I I I I II I I I I I I I 
143 TpvQFFCWFAFRYMWTYSAGAIAENVWHTTDASSVGHQEAGNRYGVLAAVXSVAAVICSF 
250 260 270 280 290 300 

310 320 330 340 350 360 

ml43 pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 

: | | | | U | | | | | | II I I M I I I I I I 1 I : I M 11111:111 I I I I I I I I M I 11 M M : 
Q l 43 ILAKVPNKYHKAGYFGCLALGALGFFSIFFIYNQYALILSYILIGIAWAGIITYPLTIVA 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 3 pep NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 

Mllllll II I I 1 I I I I : I I I I I I M I I II I M I I II I I I I t M I : I : MM 

g 14 3 NALSGKHMDTYLGLFNGSVCMPQIVASLLSFVLFPMLGGHQATMFLVAGAVLLLGAFSVC 

370 380 390 400 410 420 



430 

ml 4 3 . pep LIKETHGGVX 
I 11 I 1 I I I I 
gl4 3 LIKEIHGGVX 

430 

The following partial DNA sequence was identified inN. meningitidis <SEQ ID 569>: 

al43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAGATG AGCCGCATCT TCCAGACGCT CGGTGCCGAT CCGCACAGCC 

101 TCGGCTGGTT CTTTATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

2 51 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC GAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACAATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

7 01 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 
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7 51 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA' TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 AT TAT C ACT T ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCC TGTTTAACGG CTCTATCTGT ATGCCGCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

12 51 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 570; ORF 143. a>: 

al4 3 .pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHSLGW FFIL FPLAGMLVQP 

51 IVG HY5DRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFG FGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAGY FGCLAL GALGFFSVFF I GNQY ALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

4 01 QATMF LVGGV VLLLGAJTSVF LI KETHGGV* 

ml43/al43 99.5% identity in 429 aa overlap 

10 20 30 40 50 60 

ml 4 3 .pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
M I M I I I I I II I I I I I I I 1 M I I I I I 1 I M I : I I 1 1 1 It I I I I I I I I II I I ! II M M I 
a 14 3 MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHSLGWFFILPPLAGMLVQPIVGHYSDRTW 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml43 .pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
I I I I I 1 I I It I I I I I I I 1 I I I I I I I M I M I I I M I I I I I M I I I I I I I I I I II I I I I i I 
al4 3 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 3. pep QPFKMMVGDMVNEEQKGYAYGIQSFLANTGAVVAAILPFVFAY1GLANTAEKGVVPQTVV 
M I I M N 1 II I 1 II I I I I I 1 I I I i I I M I I N I I I M I i II I M I I I I I I I I I I I I I I I 
al4 3 QPFKMMVGDMWEEQKGYAYGIQSFLANTGAVVAAILPFVFAYIGLANTAEKGWPQTVV 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml4 3 .pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
M I M M I I I ! M I II M I I I I II : I ! ! I ! I M I I M I I I I I I I I ! I I M I I ! I I M I 11 
al4 3 VAFYVGAALLVIT SAFTI FKVKE YN PET YARYHGI DVAANQEKANWI ELLKTAPKAFWTV 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 4 3 .pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 
I I I II 1 I I i I M I I I I II ! I I I II i I I M M I I II II \ I I I I I I I I II I 11 I 1 II i I I M 
a 14 3 TLVQFFCWFA FQYMWTYSAG AIAENVWHTT DAS S VGYQEAGNWYGVLAAVQSVAAV I CS F 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 3 . pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
I I I I I I I I I I I I 1 I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
al 4 3 VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAG IITYPLTIVT 

310 320 330 340 350 360 



ml 4 3 .pep 



370 380 390 400 410 420 

NALSGKHMGT YLGLFNGS I CMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 
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408 



al43 



] I M I I I I i I I I I M I I U I I M i I I I I M I I M I M I M M I I I I I I I I I f M I I I I I I 
NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATOFLVGGVVLLLGAFSVF 
370 380 390 400 410 420 



ml 4 3 .pep 



430 

LIKETHGGVX 



al43 




430 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 57 1>: 



1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGGGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGTGC GTCTTCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC CCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

2 51 GTGCGGCGTT CGACATCAAC GGTAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCcgtTT 

351 CAACGCGGTG GCGGCAGACG GccgacggTt atCCCAACGA TTTGGatatT 

4 01 TCctaccgCT TGGACGAGGA CGGCCGGCTT ACCGTtaccT ATCGCGCChC 

4 51 CGCgctCGGC GACACGGTGT TCGACCCGAC GCTGCACATT TACTGGCGGC 

501 TGGACGCGGG CCTGCACGAT GCGGTTCTGC ATATTCCGCA GGGCGGACAT 

551 ATTCCGGCCG AT G C CGAAAA ACTGCCCGTC TTAACGGTTT CAGACGGCCT 

601 CG AAGTATTT GA 



This corresponds to the amino acid sequence <SEQ ID 572; ORF 144.ng>: 



1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT C VFVLDLGGIV QEFSVL ADGV 

51 RENPVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRRLSQR FGYFLPLGRG RPAYRYLSRH 

151 RARRHGVRPD AAHLLAAGRG PARCGSAYSA GRTYSGRCRK TARLNGFRRP 

201 RSI* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 573>: 



1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGTCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGATGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAAC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTGg 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TAcCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATGCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 



This corresponds to the amino acid sequence <SEQ ED 574; ORF 144>: 



1 MSDTPATRDF GLIDGRAVTG YVLSNRRGT R VCVLDLGGIV QEFSVL ADGV 

51 RENLWSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLATVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYA 

201 GRCRKTARLN GFRRPRSI* 



gl44 . seq 



gl4 4 .pep 



m!44 . seq 



ml 4 4 . pep 



Computer analysis of this amino acid sequence gave the following results: 
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Homology with a predicted ORF from N. gonorrhoeae 

ml44 / gl44 91.3% identity in 218 aa overlap 

10 20 30 40 50 60 

ml 4 4 . pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGI VQEFSVLADGVRENLWSFDD 

I I I ! II I I II I I II I I M I ! I I I I I 1 I II I I I I I I I I I I I I M M 1 I I I I I I I I I I I 
g 14 4 MSDTPATRDFGLIDGRAVTGYVLSNRRGTCVFVLDLGGIVQEFSVLADGVRENPWSFDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 4 . pep AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
I I I 1 I I I 1 I I I I I I I I I I I I I I I I I I 1 1 I I I I I M I I I II i I I M I I I M M I M I M I I 
gl4 4 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 4 . pep AADGRSWLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 
Mi I 1 I I I I I I I II I I I I M I I I I I II I I M I I M I I I II M I II 

gl4 4 AAD GRRLSQRFG — YFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 

190 200 210 219 

ml 4 4 . pep AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I II I I M I I I I I 1 I I M I : I I I I I II I I I M I M I I I 1 
gl4 4 AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
170 180 190 200 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 575>: 

al4 4 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

2 51 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTG . 

4 01 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TACCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATTCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 576; ORF 144.a>: 

al4 4 .pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

51 RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLXTVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYS 

201 GRCRKTARLN GFRRPRSI* 

ml44/al44 99.1% identity in 218 aa overlap 

10 20 30 40 50 60 

ml 4 4 .pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
I I I I I I I I II I I I I I I I I I H I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
al4 4 MSDTPATRDFGLIDGRAVTGYVLSMRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 4 .pep AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
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I | | | | | M I I I I I I II I 11 I I I I I I ! M I M I M I I 1 I i I I M I I I II I I I I II II M M 
AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

AADGRSWLRSRLATVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

I |t M II I M I I I I II I I M I I M I I I I I II I I I I I I I I I M I II I I I I I I M M M ! 1 
AADGRSWLRSRLXTVGRRLSQRFGFGYFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 170 180 

190 200 210 219 

AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 

I | M I I I I I I I M I I I I I I : I I I I I I I M I I I I I I M I I 
AAGRG PARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
190 200 210 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 577>: 

gl4 6 . seq 

1 ATGAAGCAAA TCCCCCTCCG CCTTCTCCAG GTCGTCATTG ACCACGACAA 

51 AGTCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAa ctTCCCGACT GTCCGTCCCG CGCcctTTGA GGCGCGCGGC 

151 AAGCACGTCG AAAGAAGGCG G C AGG AT AAA GATACCGACA GCTTCCGGCA 

201 GCGCGTTGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

2 51 TCATAGCCTG CCGCCGCCAA CGCATTCACG CCCTCCGTGC TTGTGCCGTA 

301 ATAGTTGCCG AATACGTCTG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGATTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGGCG CGTCGTATGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAA CCGATTGTAT CACAACGGAC ACCCTATTTC 

601 ATATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 578; ORF 146.ng>: 

gl46 -pep 

1 MKQIPLRLLQ VVIDHDKVEQ YGLFDFMPCL RQPPLDNFPT VRPAPFEARG 

51 KHVERRRQDK DTDSFRQRVA NLRRALNVDF QNHVIACRRQ R IHALRACAV 

101 IVAEYVCVFQ KSLLRDKRFK LFFGNKVIMY AVCFAFTRRA RRMRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPYF 

201 IFADAHILPL LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 579>: 

ml 4 6 . seq 

1 ATGGCGCAAA TCCTCCTCCG CTCGCGCCAA GTCGTCATTG ACCACGACAA 

51 AGT CAAACAA TACGGACTGC TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GGCGCGCGGC 

151 AAGTACGTCG AAAGAAGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCGTCGCG AACCTGCGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAGCCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC CTGTGCCGTA 

301 ATAGTTGCCA AATACGTCGG CGTATTCCAA AAAAGCTTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGGCG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAATGGAC ACCCAGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ED 580; ORF 146>: 

ml 4 6 . pep 

1 MAQILLRSRQ WIDHDKVKQ YGLLDFMPCL RQPPLDNFPT VR PASVEARG 

51 KYVERRRQDK DADGFGQRVA NLRRALNVDF QNHVIACRRQ RIHTLRACAV 

101 IVAKYVGV FQ KSFLRDKRLK LFFGNKVIMY AVCFAFTRRA RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQWTPSF 

201 LFADAHILPL LF* 



ml 4 4 . pep 

al44 

ml 4 4 . pep 
al44 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml46 / gl46 90.1% identity in 212 aa overlap 

10 20 30 40 50 60 

mi4 6 pep maqillrsrqvvidhdkvkqyglldfmpclrqppldnfptvrpasveargkyverrrqdk 

I || || I 1 I I I I M I : I M I : I I II It It II I M I I M ! I I 11111:11111111 
ql46 MKQIPLRLLQVVIDHDKVEQYGLFDFMPCLRQPPLDNFPTVRPAPFEARGKHVERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 6 pep DADGFGQRVANLRRALNVDFQNHVIACRRQRIHTLRACAVIVAKYVGVFQKSFLRDKRLK 
|:|:| | | | M M I I 1 I 1 M M I I I I I I I I I 1 I : II M I I I I I : I t I I I M : I I M I : I 
ql46 DTDSFRQRVANLRRALNVDFQNHVIACRRQRIHALRACAVIVAEYVCVFQKSLLRDKRFK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 6 pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

| | I | ! | | | | M I I I I I I I I I I I : I I It I I I I II I M I I t I I ! I I I I M I I I I I I I I I M I 
ql46 LFFGNKVIMYAVCFAFTRRARRMRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 

190 200 210 

ml46.pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I I I I II II I I 1 I M I M I : M I I I II I I I I I 
g 14 6 GHI FYLYI FQPIVSQRT PYFI FADAHILPLLFX 

190 200 210 

The following partial DNA sequence was identified in//, meningitidis <SEQ ID 58 1>: 

al4 6 . seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG AC C AC G AC AA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

2 01 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

2 51 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

4 01 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

a!4 6 . pep 

1 MAQILLRPRQ VIIDHDKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRV FQ KSLLRDKRLK LFFGNKVIMY AVCFAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHI FYLYI FQ PIVSQRTPGF 

2 01 LFADAHILPL LF* 

ml46/al46 90.6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml 4 6 pep MAQILLRSRQWIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I | | | M | | I I : I I I I I :: I I I I : I I I I I I I M I I I I I I I I I I I I I I : I : I - I I I I t I 1 
al46 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

10 20 30 40 50 60 

70 80 90 1 100 110 120 
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^ 412 ^ 

ml 4 6 pep dadgfgqrvanlrralnvdfqnhviacrrqrihtlracavivakyvgvfqksflrdkrlk 

I I I I I I I I : : M I I II M M I I M : M I I I II I I I I M I M I : : I M I I I : I I I I M I 
al46 DADGFGQRISNLSRALNVDFQNHVITCRRQRIHTLRACAVIVAEHVRVFQKSLLRDKRLK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 6 pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 
| | M l | J | M || I II I II I : I II I I I M I 1 I I II I ! II I I I I ! ! I ! I I M I ! I I i I M I ! 
al 46 LFFGNKVIMYAVCFAFTRRTRRVRHGNAQTVMVCQQPRHQRG FARAGSGRNDKDVAFS IS 

130 140 150 160 170 180 

190 200 210 

ml4 6 . pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
I M I I I I I I I I II I I I I : I II M I I I I I I I I I 
a!4 6 GHIFYLYIFQPIVSQRTPGFLFADAHILPLLFX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 583>: 

gl47 . seq (partial) 

1 . .ATGCGACGAG AAGCCAAAAT GGCACAAATC ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CGCCCAAGCG CATGAAACTG 

101 AGCAATCGGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCGACTT CGGGGCTGCT GCACACTTCG ACCGCCTCCG ACAAAATCAT 

201 CTCCGGCGAT ACTTTGCGCC AAAAAGCCGT CAACTTGGGC GACGCTTTGG 

251 ACGGCGTACC GGGCATCCAC GCTTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC GGGCAGACGG ATTAAAGTAT TGAACCATCA 

351 CGGCGAAACG GGCGATATGG CGGACTTTTC TCCCGATCAC GCCATTATGG 

4 01 TAGATACCGC CTTGTCGCAA CAGGTTGAAA TCCTGCGCGG GCCGGTTACG 

4 51 CTCTTGTACA GCTCGGgcaa tgtggccgGG GCTGGtcaat gttgccgatg 

501 gAAAAAtccc ccaaaaAAtg cc . 

This corresponds to the amino acid sequence <SEQ ID 584; ORF 147.ng>: 

gl47.pep (partial) 

1 . . MRREAKMAQI TLKPIVLSIL LINTPLLAQA HETEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDTALSQ QVEILRGPVT 

151 LLYSSGNVAG AGQCCRWKNP PKNA. . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 585>: 

ml47.seq (partial) 

1 . . CCGCATAAAA CTGAGCAATC GGTGGATTTG GAAACGGTCA GCGTCGTCGG 

51 CAAAAGCCGT CCGCGCGCCA CGTCGGGGCT GTTGCACACT TCGACCGCCT 

101 CCGACAAAAT CATCTCCGGC GATACCTTGC GCCAAAAAGC CGTCAACTTG 

151 GGCGACGCTT TAGACGGCGT ACCGGGCATC CACGCTTCGC AATACGGCGG 

2 01 CGGCGCGTCT GCTCCCGTCA TTCGCGGTCA AACAGGCAGG CGGATTAAAG 

251 TGTTGAACCA TCACGGCGAA ACAGGCGATA TGGCGGATTT TTCGCCCGAT 

301 CACGCCATTA TGGTAGATAC CGCCTTGTCG CAACAGGTCG AAATCCTGCG 

351 CGGGCCGGTT ACGCTCTTGT ACAGCTCGGG CAATGTGGCG GGGCTGGTCG 

4 01 ATGTTGCCGA TGGCAAAATC CCCGAAAAAA TGCCTGAAAA CGGCGTATCG 

4 51 GGCGAACTCG GATTGCGTTT GAGCAGCGGC AATCTGGAAA AACTCACGTC 

501 CGGCGGCATC AATATCGGTT TGGGCAAAAA CTTTGTATTG CACACGGAAG 

551 GGCTGTACCG CAAATCGGGG GATTACGCCG TACCGCGTTA CCGCAATCTG 

601 AAACGCCTGC CCGACAGCCA CGCCGATTCG CAAACGGGCA GCATCGGGCT 

651 GTCTTGGGTT GGCGAAAAAG GTTTTATCGG CGTAGCGTAC AGCGACCGTC 

7 01 GCGACCAATA TGGTCTGCCT GCCCACAGCC ACGAATACGA TGATTGCCAC 

7 51 GCCGACATCA TCTGGCAAAA GAGCTTGATT AACAAACGCT ATTTACAGCT 

801 TTATCCGCAC CTGTTGACCG AAGAAGACAT CGATTACGAC AATCCGGGCT 

851 TGAGCTGCGG CTTCCACGAC GACGATAATG CACACGCACA CACCCACAGC 

901 GGCAGACCGT GGATAGACCT GCGCAACAAA CGCTACGAAC TCCGTGCCGA 

951 ATGGAAGCAA CCGTTCCCCG GTTTTGAAGC CCTGCGCGTA CACCTGAACC 

1001 GCAACGACTA CCGCCACGAC GAAAAAGCAG GCGATGCAGT CGAAAACTTT 

1051 TTTAACAACC AAACGCAAAA CGCCCGCATC GAGTTGCGCC ACCAACCCAT 
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1101 AGGTCGTCTG AAAGGCAGCT GGGGCGTGCA ATATTTACAA CAAAAATCCA 

1151 GTGCTTTATC TGCCATATCC GAAGCGGTTA AACAACCGAT GCTGCTTGAC 

12 01 AACAAAGTGC AACATTACAG CTTTTTCGGT GTAGAACAGG CAAACTGGGA 

12 51 CAACTTCACG CTTGAAGGAG GCGTACGCGT GGAAAAACAA AAAGCCTCCA 

1301 TTCAGTACGA CAAAGCATTG ATTGATCGGG AAAACTACTA CAACCACCCC 

1351 CTGCCCGACC TCGGCGCGCA CCGCCAAACC GCCCGCTCAT TCGCACTTTC 

14 01 GGGCAACTGG TATTTCACGC CACAACACAA ACTCAGCCTG ACCGCCTCCC 

14 51 ATCAGGAACG CCTGCCGTCA ACGCAAGAGC TGTACGCACA CGGCAAACAC 

1501 GTCGCCACCA ACACCTTTGA AGTCGGCAAC AAACACCTCA ACAAAGAGCG 

1551 TTCCAACAAT ATCGAACTCG CGCTGGGCTA CGAAGGCGAC CGCTGGCAAT 

1601 ACAATCTGGC ACTCTACCGC AACCGCTTCG GTAACTACAT TTACGCCCAA 

1651 ACCTTAAACG ACGGACGCGG CCCCAAATCC ATCGAAGACG ACAGCGAAAT 

17 01 GAAGCTCGTG CGCTACAACC AATCCGGCGC CGACTTCTAC GGCGCGGAAG 

17 51 GCGAAATCTA CTTCAAACCG ACACCGCGCT ACCGCATCGG CGTTTCCGGC 

18 01 GACTATGTAC GAGGCCGTCT GAAAAACCTG CCTTCCCTAC CCGGCAGAGA 
1851 AGATGCCTAC GGCAACCGTC CTTTCATCGC ACAGGACGAC CAAAATGCCC 
1901 CCCGTGTTCC GGCTGCGCGC CTCGGCTTCC ACCTGAAAGC CTCGCTGACC 
1951 GACCGTATCG ATGCCAATTT GGACTACTAC CGCGTGTTCG CCCAAAACAA 
2 001 ACTCGCCCGC TACGAAACGC GCACGCCCGG ACACCATATG CTCAACCTCG 
2051 GCGCAAACTA CCGCCGCAAT ACGCGCTATG GCGAGTGGAA TTGGTACGTC 
2101 AAAGCCGACA ACCTGCTCAA CCAATCCGTT TACGCCCACA GCAGCTTTCT 
2151 CTCTGATACG CCGCAAATGG GCCGCAGCTT TACCGGCGGC GTGAACGTGA 
2201 AGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 

ml 4 7. pep (partial) 

1 . . PHKTEQSVDL ETVSVVGKSR PRATSGLLHT STASDKIISG DTLRQKAVNL 

51 GDALDGVPGI HASQYGGGAS APVIRGQTGR RIKVLNHHGE TGDMADFSPD 

101 HAIMVDTALS QQVEILRGPV TLLYSSGNVA GLVDVADGKI PEKMPENGVS 

151 GELGLRLSSG NLEKLTSGGI NIGLGKNFVL HTEGLYRKSG DYAVPRYRNL 

201 KRLPDSHADS QTGSIGLSWV GEKGFIGVAY SDRRDQYGLP AHSHEYDDCH 

2 51 ADIIWQKSLI NKRYLQLYPH LLTEEDIDYD NPGLSCGFHD DDNAHAHTHS 
301 GRPWIDLRNK RYELRAEWKQ PFPGFEALRV HLNRNDYRHD EKAGDAVENF 

3 51 FNNQTQNARI ELRHQPIGRL KGSWGVQYLQ QKSSALSAIS EAVKQPMLLD 

4 01 NKVQHYSFFG VEQANWDNFT LEGGVRVEKQ KASIQYDKAL IDRENYYNHP 
4 51 LPDLGAHRQT ARSFALSGNW YFTPQHKLSL TASHQERLPS TQELYAHGKH 
501 VATNTFEVGN KHLNKERSNN IELALGYEGD RWQYNLALYR NRFGNYIYAQ 
551 TLNDGRGPKS IEDDSEMKLV RYNQSGADFY GAEGEIYFKP TPRYRIGVSG 
601 DYVRGRLKNL PSLPGREDAY GNRPFIAQDD QNAPRVPAAR LGFHLKASLT 
651 DRIDANLDYY RVFAQNKLAR YETRTPGHHM LNLGANYRRN TRYGEWNWYV 
701 KADNLLNQSV YAHSSFLSDT PQMGRSFTGG VNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml47 / gl47 92.3% identity in 142 aa overlap 

10 20 30 

ml47 . pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

I : I ! I I I I I I I I I M I I I 1 I I I I I i It I I 
gl 4 7 MRREAKMAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVS WGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 .pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I If I I I I I I I I t M I I I I I I I I I I 
gl4 7 TASDKI ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

100 110 120 130 140 150 

ml47 . pep GDMADFSPDHAIMVDTALSQQVEILRG PVTLLYSSGNVAGLVDVADGKI PEKMPENGVSG 

I 1 M I I I I I M I I I I I I I I | | | | | | | M | | | | | | | | 1 | | | : | | | 

g!4 7 GDMADFSPDHAIMVDTALSQQVEILRG PVTLLYSSGNVAGAGQCCRWKNPPKNA 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 587>: 

al4 7 . seq 

1 ATGCGACGAG AAGCCAAAAT GGCACAAACT ACACTCAAAC CCATTGTTTT 

51 ATCAATTCTT TTAATCAACA CACCCCTCCT CTCCCAAGCG CATGGAACTG 

101 AGCAATCAGT GGGCTTGGAA ACGGTCAGCG TCGTCGGCAA AAGCCGTCCG 

151 CGCGCCACTT CGGGGCTGCT GCACACTTCT ACCGCCTCCG ACAAAATCAT 

201 CAGCGGCGAC ACCTTGCGAC AAAAAGCCGT CAACTTGGGT GATGCTTTAG 

2 51 ACGGCGTACC GGGCATTCAT GCCTCGCAAT ACGGCGGCGG CGCATCCGCT 

301 CCCGTTATTC GCGGTCAAAC AGGCAGACGG ATTAAAGTGT TGAACCATCA 

351 CGGCGAAACG GGCGACATGG CGGACTTCTC TCCAGACCAT GCAATCATGG 

4 01 TGGACAGCGC CTTGTCGCAA CAGGTCGAAA TCCTGCGCGG TCCGGTTACG 

4 51 CTCTTGTACA GCTCGGGCAA TGTGGCGGGG CTGGTCGATG TTGCCGATGG 

501 CAAAATCCCC GAAAAAATGC CTGAAAACGG CGTATCGGGC GAACTCGGAT 

551 TGCGTTTGAG CAGCGGCAAT CTGGAAAAAC TCACGTCCGG CGGCATCAAT 

601 ATCGGTTTGG GCAAAAACTT TGTATTGCAC ACGGAAGGGC TGTACCGCAA 

651 ATCGGGGGAT TACGCCGTAC CGCGTTACCG CAATCTGAAA CGCCTGCCCG 

7 01 ACAGCCACGC CGATTCGCAA ACGGGCAGCA TCGGGCTGTC TTGGGTTGGC 

7 51 GAAAAAGGCT TTATCGGCGC AGCATACAGC GACCGTCGCG ACCAATATGG 

8 01 TCTGCCTGCC CACAGCCACG AAT AC G AT G A TTGCCACGCC GACATCATCT 
851 GGCAAAAGAG TTTGATTAAC AAACGCTATT TGCAGCTTTA TCCGCACCTG 
901 TTGACCGAAG AAGACATCGA TTACGACAAT CCGGGCTTGA GCTGCGGCTT 
951 TCACGACGAC GATGATGCAC ACGCCCATGC CCACAACGGC AAACCTTGGA 

1001 TAGACCTGCG CAACAAACGC TACGAACTCC GCGCCGAATG GAAGCAACCG 

1051 TTCCCCGGTT TTGAAGCCCT GCGCGTACAC CTGAACCGCA ACGACTACCG 

1101 CCACGACGAA AAAGCAGGCG ATGCAGTAGA AAACTTTTTT AACAACCAAA 

1151 CGCAAAACGC CCGTATCGAG TTGCGCCACC AACCCATAGG CCGTCTGAAA 

12 01 GGCAGCTGGG GCGTGCAATA TTTGGGACAA AAATCCAGTG CTTTATCTGC 

12 51 CACATCCGAA GCGGTCAAAC AACCGATGCT GCTTGACAAT AAAGTGCAAC 
1301 ATTACAGCTT TTTCGGTGTA GAACAGGCAA ACTGGGACAA CTTCACGCTT 

13 51 GAAGGCGGCG TACGCGTGGA AAAACAAAAA GCCTCCATCC GCTACGACAA 
1401 AG C ATT GAT T GATCGGGAAA ACTACTACAA CCATCCCCTG CCCGACCTCG 

14 51 GCGCGCACCG CCAAACCGCC CGCTCATTCG CACTTTCGGG CAACTGGTAT 
1501 TTCACGCCAC AACACAAACT CAGCCTGACC GCCTCCCATC AGGAACGCCT 
1551 GCCGTCAACG CAAGAGCTGT ACGCACACGG CAAACACGTC GCCACCAACA 
1601 CCTTTGAAGT CGGCAACAAA CACCTCAACA AAGAGCGTTC C AAC AAT AT C 
1651 GAACTCGCGC TGGGCTACGA AGGCGACCGC TGGCAATACA ATCTGGCACT 
1701 CTACCGCAAC CGCTTCGGCA ACTACATTTA CGCCCAAACC TTAAACGACG 
1751 GACGCGGCCC CAAATCCATC G AAG AC G AC A GCGAAATGAA GCTCGTGCGC 
1801 T AC AAC C AAT CCGGTGCGGA CTTCTACGGC GCGGAAGGCG AAATCTACTT 
1851 CAAACCGACA CCGCGCTACC GCATCGGCGT TTCCGGCGAC TATGTACGAG 
1901 GCCGTCTGAA AAACCTGCCT TCCCTACCCG GCAGGGAAGA CGCCTACGGC 
1951 AACCGCCCAC TCATTGCCCA AGCCGACCAA AACGCCCCTC GCGTTCCGGC 
2001 TGCGCGCCTC GGCGTCCACC TGAAAGCCTC GCTGACCGAC CGCATCGATG 
2051 CCAATTTGGA CTACTACCGC GTGTTCGCCC AAAACAAACT CGCCCGCTAC 
2101 GAAACGCGCA CGCCCGGACA CCATATGCTC AACCTCGGCG CAAACTACCG 
2151 CCGCAATACG CGCTATGGCG AGTGGAATTG GTACGTCAAA GCCGACAACC 
2201 TGCTCAACCA ATCCGTTTAC GCCCACAGCA GCTTCCTCTC TGATACGCCG 
2251 CAAATGGGCC GCAGCTTTAC CGGCGGCGTG AACGTGAAGT TTTAA 

This corresponds to the amino acid sequence <SEQ ID 588; ORF 147. a>: 

al4 7.pep 

1 MRREAKMAQt TLKPIVLSIL LINTPLLSQA HG TEQSVGLE TVSWGKSRP 

51 RATSGLLHTS TASDKIISGD TLRQKAVNLG DALDGVPGIH ASQYGGGASA 

101 PVIRGQTGRR IKVLNHHGET GDMADFSPDH AIMVDSALSQ QVEILRGPVT 

151 LLYSSGNVAG LVDVADGKIP EKMPENGVSG ELGLRLSSGN LEKLTSGGIN 

201 IGLGKNFVLH TEGLYRKSGD YAVPRYRNLK RLPDSHADSQ TGSIGLSWVG 

251 EKGFIGAAYS DRRDQYGLPA HSHEYDDCHA DIIWQKSLIN KRYLQLYPHL 

301 LTEEDIDYDN PGLSCGFHDD DDAHAHAHNG KPWI DLRNKR YELRAEWKQP 

351 FPGFEALRVH LNRNDYRHDE KAGDAVENFF NNQTQNARIE LRHQPIGRLK 

401 GSWGVQYLGQ KSSALSATSE AVKQPMLLDN KVQHYSFFGV EQANWDNFTL 

4 51 EGGVRVEKQK ASIRYDKALI DRENYYNHPL PDLGAHRQTA RSFALSGNWY 

501 FTPQHKLSLT ASHQERLPST QELYAHGKHV ATNTFEVGNK HLNKERSNNI 

551 ELALGYEGDR WQYNLALYRN RFGNYIYAQT LNDGRGPKSI EDDSEMKLVR 
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601 YNQSGADFYG AEGEI YFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

701 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AHSS FLSDTP 

7 51 QMGRSFTGGV NVKF* 

ml47/al47 98.1% identity in 734 aa overlap 

10 20 30 

ml47 .pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

I Mill I I I I I I f I I I 1 M I I I I t M I I 
a l4 7 MRREAKMAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSWGKSRPRATSGLLHTS 

10 20 30 40 50 60 

40 50 60 70 80 90 

ml47 .pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
I I t I I I I ! I I I I I I I i I I I I I I M I I I I I I I I M I I I I I I i I I I I I I I I M M I I I I I I I 
a 1 4 7 TASDKI I SGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 

70 80 90 100 110 120 

100 110 120 130 140 150 

ml47 .pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
I I I I ! I I I I I I I I I I : I M I I I I M I I I I I I II I M I I I I 1 i I I I I I I M M I I I I I I ! I 
a 14 7 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

130 140 150 160 170 180 

160 170 180 190 200 210 

ml 4 7 .pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
I I I I I I I I I M 11 I I I I t I I i I I I ! I M I I I 11 I I I I I I I I I II II M I I I I I I M I I i I 
a 14 7 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 

190 200 210 220 230 240 

220 230 240 250 260 270 

ml47 .pep TGSIGLSWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
I I I I M I I I I I II 1 I I : t I I I I M I I I I I I 11 I I I I I I I I M I I I M I I I I I I I 1 I I M I 
al4 7 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 

250 260 270 280 290 300 

280 290 300 310 320 330 

ml 4 7 .pep LTEEDIDYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVH 
I I I I II I I I M I I I I M I I I I : I I I I : I : I : I I I I I I I I t I I t I I I I I II I II I I I I I E I 
al4 7 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 

310 320 330 340 350 360 

340 350 360 370 380 390 

ml 4 7 . pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
I I M I I M I I t M I I I I I I I I II I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I II 
a 14 7 LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 

370 380 390 400 410 420 

400 410 420 430 440 450 

ml47 . pep AVKQPMLL DN KVQH Y S FFGVEQANW DN FT LEGGVRVEKQKAS I QY DKAL I DREN Y YNH PL 

I I I I II I I I I I I I I I I I I I I M I M I I I I I II I 1 I I I I I I I I I : I I I I I It I I I I I I I I I 
a 14 7 AVKQPMLLDNKVQH Y S FFGVEQANWDN FT LEGGVRVEKQKAS I RYDKAL I DREN Y YNH PL 

430 440 450 460 470 480 

460 470 480 490 500 510 

ml 4 7 . pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
I I I I I I II I I I II I I I I I M I I I I I I II I I M I I I I I I I I M I I M I I I I I I I I I I I I I I 
al4 7 PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 

490 500 510 520 530 540 

520 530 540 550 560 570 

ml 47 .pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVR 
I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M II I I II I I I I I I I I M I I I I I I I I 
al4 7 HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVB 
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550 560 570 580 590 600 

580 590 600 610 620 630 

ml4 7 .pep YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQ 
| | | | | | M | I | I I ! I I I ! I I M M M I I I I I I I I I I II I I I I M I I I I 11 I I I : I II II 
a 14 7 YNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLIAQADQ 

610 620 630 640 650 660 

640 650 660 670 680 690 

ml 4 7 . pep NAPRVPAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 
I I I I I I I I I I I I I I I I M I I I I I II I II I I I I I II I I I I I I I I 1 I I I I i I M I I I I I M 
a 14 7 NAPRVPAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNT 

670 680 690 700 710 720 

700 710 720 730 

ml 4 7 . pep R YGEWNWYVKADNLLNQS V YAH SSFLSDT PQMGR S FTGGVN VKFX 

I I i I I I I I I I I I I I M I M I I II I I I 1 I II I I I I 1 I I I I I I II If 
al4 7 RYGEWNWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

730 740 750 760 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 589>: 

gl48 . seq 

1 ATGGCGTTAA AAAC AT C AAA CTTGGAACAC GCAATGctgg ttcaTCCCGA 

51 AgctATgagt gtcggcgCGC TTGccgAcaa AATCCGCAAA AtcgaAAact 

101 gGCCGCAAAA AGgcaTCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGT 

151 GCGGAATACT TCCGCCTTTT GGTCGATTTG CTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGCTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAaCGtcg gctTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTAcg cgcTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGa tgccgTCAAA CCCGGTTCGC 

4 01 GCGTCCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC AATGCTTGCC 

4 51 GGGCTGGAAC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAgccgccgC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGCGCAAGTG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGCAT GAAAGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 590; ORF 148.ng>: 

gl4 8 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK PGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 591>: 

ml48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGCGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTTCAAAGC 

151 GCGGAATACT TCCGCCTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGATTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAAC TGATCCGCAA ACTCGGCGGA GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 592; ORF 148>: 

ml 48 . pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI DIVAGLDARG FIIGAALAYQ LNVGFVPIRK 
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101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m!48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 .pep MALKTSNLEHAMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS AEYFRLLVDL 

| M I I I I I I I I II I I I I M I I I I I I I I I I i 1 I M I I U I I I I ! I I I I I M M M I I I I I I 
g 14B MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 8 pep LVYRYMDQKIDI VAGLDARGFI I G AALA YQ LN VG FV P I RKKGKLPFETVS QSYALEYGEA 

M | I I I I I I I I M I I M I I I I I I I I II I I I I I I I I I I I I II I i I I I I M I I I I I I I II I I 
g 14 8 LVYRYMDQKIDI VAGLDARGFI I GAALAYQLN VG FV P I RKKGKLPFETVS QSYALEYGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 8 . pep AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 

M I I M I I I I Ml | : I M I M I I I I I II I 1 I I I I I 1 I I I I t I I M I I I I II I I 

g 14 8 AVE IHTDAVKPGSRVLLVDDLVATGGTMLAGLELIRKLGGE I VEAAAILE FTDLQGGKNI 

130 140 150 160 170 180 

190 200 
ml 4 8 . pep RASGAPLFTLLQNEGCMKGX 
I I I I I I I I I I I I I I I I I I I I 
gl 4 8 RASGAPLFTLLQNEGCMKGX 

190 200 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 593>: 

al48 . seq 

1 ATGGCGTTAA AAAC AT C AAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA ATCGAAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC G C T AT AT GG A 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

4 51 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148.a>: 

al4 8 .pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/al48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8 .pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKG I LFHDITPVLQS AEYFRLLVDL 

I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 1 I I I I I I I I i I I I I I I M I I I I I I I I I I I 
al4 8 MALKTSNLEHAMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS AEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 
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LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

| | | | | 1 | | | | M I I I I I I I I M I I M I I I II I I I I I I t II II I I I I I I I 1 1 I 1 I I I I I I I 
LVYRYMDQKIDIVAGLD7VRGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

70 80 90 100 110 120 

130 140 150 160 170 180 

AVEIHTDAVKLGSRVLLVDDLIATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 

I | | M I li I I I M I I : II I I I I M I I I II I I I I I I I I I M I I I II I I I I M I I I 

AVEIHTDAVKLGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
130 140 150 160 170 180 

190 200 
RASGAPLFTLLQNEGCMKGX 

I II I I I I I I I I I I II I I I I I 
RASGAPLFTLLQNEGCMKGX 
190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 595>: 

gl49 . seq 

1 ATGTTGATTG ACAACAATGT CCGCCATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAATTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGGTAC GACAAAGCAT TGATTGATCG AG AAAAC TAG 

151 TACAACCAGC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 GTTCGCACTT TCGGGCAACT GGTATTTCAC GCCACACCAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAa cgCCTGCCGT CAACGCaagA actGtACgca 

301 cacggcAAGC ACGtcgccac CAACACCTTT GAagtcggca acaaACACCT 

3 51 CAACAAAGaG CgttccaacA atatcgaACT CGCGCTGGgc tAcaaaggcg 

4 01 accGCTGGCA AT ACAAT CTG GCAGCCTACC GCAACCGAtT CGGCAACTAC 
4 51 ATTTACGCCC AAACCTTAaa cgacggacgC GGCCCCAAAT CCATCgaaga 
501 cgacagcgaA ATGaagcTCG TGCGCTACAA CCAATCCGGT GCCGACTTCT 
551 ACGgcgcggA aggcgaaatc tACTTcaaaC CGAcACCGCG CTACCGCATC 
601 GGTGTTTCCG GCGACTatgt acgaggccgT C TGAAAAAC C TGCCGTCCCT 
651 ACCCGGCAGG gaagatccCT AcggcAAACG TCccttcaTC GCACAAGCCG 
701 AC C AAAACG C CCCCCGCATT ccggctGCGC GCCTCGGCTT C C AC CTGAAA 
751 ACCTCGCTAA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 
801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGTACGCCC GGAC AC CAT A 
851 TGCTCAACCT CGGTGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 
901 AATTGGTACG TCAAAGCCGA CAACCTGCtC aACcaatCcg tTTACGCCCa 
951 cAGCAGCTTC CTCTCTGATA CGCCGCAAAt gGGCCGCAGC TTtgccgGCg 

1001 gcgtaAACGT GaAGTTttaA 

This corresponds to the amino acid sequence <SEQ ID 596; ORF 149.ng>: 

gl4 9 -pep 

1 MLIDNNVRHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNQPLPDLGA HRQTARSFAL SGNWYFTPHH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YKGDRWQYNL AAYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDPYGKRPFI AQADQNAPRI PAARLGFHLK 

2 51 TSLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FAGGVNVKF* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 597>: 

ml4 9 . seq 

1 ATGCTGCTTG ACAACAAAGT G CAACATT AC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG AGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATTCAGTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCACC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

201 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

3 01 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 
351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

4 01 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGTAACTAC 



ml 4 8 . pep 
al48 

ml 4 8 . pep 
al48 

ml48 . pep 
al48 
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4 51 


J~i, i. X X riv VJ ^— v*- 


DUX 


a c a r^rr? a a 


551 


ACGGCGCGGA 


601 


GGCGTTTCCG 


651 


ACCCGGCAGA 


701 


ACCAAAATGC 


751 


GCCTCGCTGA 


801 


CGCCCAAAAC 


851 


TGCTCAACCT 


901 


AATTGGTACG 


951 


CAGCAGCTTT 


1001 


GCGTGAACGT 



\AAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 
DTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 
^GTTTTAA 

This corresponds to the amino acid sequence <SEQ ED 598; ORF 149>: 

ml4 9 -pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARS FAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGE I YFKPTPRYRI 

2 01 GVS GDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 
251 AS LTDR I DAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.zonorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from N. gonorrhoeae: 

ml49/gl49 

10 20 30 40 50 60 

ml4 9 pep MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGA 

| I :|i:|:|Mli' I II Mil i ;, Mi II Mill Ml h Mill I ;i i! i I hi Mill i 

g X 4 9 ML I DNNVRHYS FFGVEQANWDNFTLEGGVRVEKQKAS I R YD KALI DREN Y YNQP LPDLGA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 9 pep HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

Ml : IMMMMI MMM :MMM i I MM I I I Mill I i IMM I Ml MM 

g l4 9 HRQTARS FALSGNWY FT PHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml4 9 pep RSNNIELAIXSYEGDRWQYNlJUjYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

t | I I M M t I I = 1 1 I I 1 1 t 1 1 II MIM M Ml IMM I IMM M Ml II Ml M MM 

g 1 4 9 RSNN I ELALG YKGDRWQYNLAAYRNRFGN YI YAQTLNDGRGPKS I EDDS EMKLVRYNQSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml4 9 pep ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFI AQDDQNAPRV 

MMMI MM M Ml MM MM MM Mill MMM IMM IhllllM MIMM 

gl4 9 ADFYGAEGE I YFKPTPRYRI GVSGDYVRGRLKNLPSLPGREDPYGKRPF I AQADQNAPRI 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml4 9 pep PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

M M MM I M I I i I I I M I I MM I II I M I I I I I I Ml IM M I M I I I I I I I Ml 

gX 4 9 P AARLG FHLKT S L TD R I D ANLD Y YRV F AQN KLAR YE T RT PGHHMLN LG ANYRRNTR YGEW 

250 260 270 280 290 300 

310 320 330 340 

ml4 9 pep NW YVKADNLLNQS VY AHS S FLSDTPQMGRS FTGGVNVKFX 

I II! II Mill MM I II III IMMMMI MMI MM 



WO 99/057280 



420 



PCT/US99/09346 



g i4 9 NWYVKADNLLNQSVYAHSS FLSDTPQMGRS FAGGVNVKFX 

310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 599>: 

al4 9 . seq 

1 ATGCTGCTTG ACAATAAAGT GCAACATTAC AGCTTTTTCG GTGTAGAACA 

51 GGCAAACTGG GACAACTTCA CGCTTGAAGG CGGCGTACGC GTGGAAAAAC 

101 AAAAAGCCTC CATCCGCTAC GACAAAGCAT TGATTGATCG GGAAAACTAC 

151 TACAACCATC CCCTGCCCGA CCTCGGCGCG CACCGCCAAA CCGCCCGCTC 

2 01 ATTCGCACTT TCGGGCAACT GGTATTTCAC GCCACAACAC AAACTCAGCC 

251 TGACCGCCTC CCATCAGGAA CGCCTGCCGT CAACGCAAGA GCTGTACGCA 

301 CACGGCAAAC ACGTCGCCAC CAACACCTTT GAAGTCGGCA ACAAACACCT 

351 CAACAAAGAG CGTTCCAACA ATATCGAACT CGCGCTGGGC TACGAAGGCG 

4 01 ACCGCTGGCA ATACAATCTG GCACTCTACC GCAACCGCTT CGGCAACTAC 

4 51 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGT GCGGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

601 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGG GAAGACGCCT ACGGCAACCG CCCACTCATT GCCCAAGCCG 

701 ACCAAAACGC CCCTCGCGTT CCGGCTGCGC GCCTCGGCGT CCACCTGAAA 

7 51 GCCTCGCTGA CCGACCGCAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

801 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GG AC AC CAT A 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

9 51 CAGCAGCTTC CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 



This corresponds to the amino acid sequence <SEQ ID 600; ORF 149.a>: 

al4 9 .pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIRY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPLI AQADQNAPRV PAARLGVHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

301 NWYVKADNLL NQSVYAHSSF LSDT PQMGRS FTGGVNVKF* 



ml49/al49 98.8% identity in 339 aa overlap 

10 20 30 40 50 60 

ml 4 9 . pep MLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGA 
1 I I I I 1 I I I I 1 I I I I I I I I I I I I M I I I M I I I I M M : I I I I I I M I I I I I M I I I I M 
a 1 4 9 MLLDNKVQHY S FFGVEQANWDN FTLEGGVRVEKQKAS I RYDKALI DRENY YNHPLPDLGA 

10 20 30 40 50 60* 

70 80 90 100 110 120 

ml 4 9 . pep HRQTARS FALSGNW Y FT PQHKL S LTASHQERLP STQEL YAHGKH VATNT FE VGNKHLNKE 

I I I I M I M I I! II I I I I I M I I I I I M I I t I M I I I I I I I I M I M I I II I I I I I I I I I 
al4 9 HRQTARSFALSGNWY FT PQHKL SLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9 pep RSNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLN DGRGPKSIEDDSEMKLVRYNQSG 

I I I M I I I I I I I i I I I I I I I I I N I I I I I I I M I I I I I I M M I I M I I I I I I I I I I II I 
a l 4 9 RSNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVRYNQSG 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 4 9. pep ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRV 

I I I I I I I I II I I I I I I M I I I I I I I M M I I I I I I I I I I I I I I M I I I : I I I I I I 1 I I I 
a l4 9 ADFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGRE D A YGNRPL I AQADQNAPRV 

190 200 210 220 230 240 



ml 4 9 .pep 



250 260 270 280 290 300 

P AARLGFHLKASLTDR I DANLDYYRVFAQNKLARYETRTPGHHMLNLG AN YRRNTRYGEW 
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I | | | M I I I II M I I II I i M I I ! I I I I I I I I II I ! 1 I I I M t I I ! 1 I ! I I I M I I I I I 
a 14 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

250 260 270 280 290 300 

310 320 330 340 

ml4 9 .pep NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
M I M I I I I I M I M I I I I II I II I I M I I I I I I M I M I 
a 14 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 601>: 

gl49-l . seq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 AC GGGC AG AC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

4 51 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 AC AAACGCT A TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA ACAATGTCCG CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGC AT T G A TTGATCGAGA 

14 01 AAACTACTAC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACACCACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 
1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 
1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 
1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 
1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 
2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 
2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 
2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 
2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 
2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 
22 51 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-1. ng>: 

gl49-l .pep 

1 MAQITLKPIV LSILLINTPL LAQA HETEQS VGLETVSVVG KSRPRATSGL 
51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 
101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 
151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 
201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 
251 AAYSDRRDRY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 
301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY HHDEKAGDAV ENFFNNKTHN ARIELRHQPI GRLKGSWGVQ 

401 YLGOKSSALS AIPETVQQPM LIDNNVRHYS FFGVEQANWD NFTLEGGVRV 

4 51 EKQKASIRYD KALIDRENYY NQPLPDLGAH RQTARSFALS GNWYFTPHHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA AYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DPYGKRPFI A 

651 QADQNAPRIP AARLGFHLKT SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

7 51 TGGVNVKF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 603>: 

ml49-l .seq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGATTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC GTCGGGGCTG 

151 TTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACCTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT AGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCGTCTG CTCCCGTCAT TCGCGGTCAA 

301 ACAGGCAGGC GGATTAAAGT GTTGAACCAT CACGGCGAAA CAGGCGATAT 

351 GGCGGATTTT TCGCCCGATC ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

4 51 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG TTTTATCGGC 

7 51 GTAGCGTACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGCTTGATTA 

851 ACAAACGCTA TTTACAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG ACGATAATGC 

951 ACACGCACAC ACCCACAGCG GCAGACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGTGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

10 51 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTC GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGCATCG 

1151 AGTTGCGCCA CCAACCCATA GGTCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTACAAC AAAAATCCAG TGCTTTATCT GCCATATCCG AAGCGGTTAA 

1251 ACAACCGATG CTGCTTGACA ACAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGAGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT TCAGTACGAC AAAGCATTGA TTGATCGGGA 

14 01 AAACTACTAC AACCACCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

1701 TAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGCGCC 

1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CTTCCCTACC CGGCAGAGAA GATGCCTACG GCAACCGTCC TTTCATCGCA 

1951 CAGGACGACC AAAATGCCCC CCGTGTTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAGCC TCGCTGACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 

2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTTCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 604; ORF 149-1>: 

ml 4 9-1 -pop 

1 MAQTTLKPIV LSILLINTPL LAQA HETEQS VDLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 VAYSDRRDQY GLPAHSHEYD DCHADIIWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDNAHAH THSGRPWIDL RNKRYELRAE WKQPFPGFEA 

351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

4 01 YLQQKSSALS AISEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIQYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 
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551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPFIA 

651 QDDQNAPRVP AARLGFHLKA SLTDRI DANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

ml49-l/gl49-l 96.2% identity in 758 aa overlap 

10 20 30 40 50 60 

ml 4 9-1 pep MAQTTLKFIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 
1 I I | 1 1 I I I I I I I I I I I I t I I I I I I I M I I I I I I I I I i I I I I I I I I I M I I I I I I I I ! 
gl49-l MAQITLKPI VLS ILLINTPLLAQAHETEQSVGLETVSVVGKSRPRATSGLLHTSTASDKI 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 9-l .pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRI KVLNHHGETGDMADF 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 1 I I t I I I I Ill 

g 1 4 9 - 1 I SGDTLRQKAVNLGDALDG VPGI HASQYGGGAS APV I RGQTGRRI KVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 9-1 pep SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I 1 I I I I 1 I I I I I I I M I I I I I I 1 I I I I i I I I I I I I 1 M I I I I 1 I 1 I I I I I I I I I I I I I I 
gl4 9-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml49-l pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I | | | | | I M I : I II I I I I I I I I I I I I I I I I I I 1 I M I I I I I I I I I I 1 I I I I ! I M I I I I I 
gl49-l SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49-l .pep SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 
| 1 I I I I I I M : I I M I I I : I I M I I t I I I I I I j I I I M t I I I I I I I t I I I I I I i I I I I I 1 
ql4 9-l SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 49-1 .pep DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
| | | I I I I I I I 1 I I I : I I I I I I : I : I I I I I I I I I I I I I M I I I I I I I 1 I I M I t I I I I I i 
gl4 9-l DY DNPGLSCGFHDGDGAHAHTHNGKPWI DLRNKR YELRAEWKQP FPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 9-1 . pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
: | | | | I I I I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I 
gl4 9-l HHDEKAGDAVENFFNNKTHNARIELRHQPIGRLKGSWGVQYLGQKSSALSAI PETVQQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml49-l -pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH 
I : I I : I : I I I I I I I I I I I I I I I I I I I I I E I I I I 1 I I I : t I I ! I I I I I I I I I : I I I I I I M 
gl49-l LIDNNVRHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNQPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml4 9-l - pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
I | I I I I I I I I I I I I 1 I I : I 1 I I I I I 1 I I I I I I I I I I I I I I M I I I I I I I 1 I I I I I I I I I I 
gl49-l RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
490 500 510 520 530 540 

550 560 570 580 590 600 

m!49-l - pep SNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
| | | I I | | I I j I I I I I I I I I I M I I I I I I I I I M I 1 I I I I I I I II I I I I I I I I I I I I I I I 
gl4 9-l SNN I ELALGYEGDRWQYNLAAYRNRFGNY I YAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml4 9-l - pep D FY GAEGEIY FKPTPRYRIG VSGDYVRGRLKNL PS LPGREDAYGNRPFIAQDDQNAPRVP 
I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11:111111 I II t I I : I 
gl49-l DFYGAEGEIY FKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRI P 

610 620 630 640 650 660 
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670 680 690 700 "710 720 

ml 4 9-1 pep AARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

| | | | I I i 1 I : I I I I I I I i I I 1 I I I I 1 I! i i I I ! I 1 I I I 1 I I I I i ! I I I U I I I I I 

q 14 9-1 AARLGFHLKTSLTDRIDANLDYYRV FAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

670 680 690 700 710 720 

730 740 750 759 

ml 4 9-1 .pep WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I I I I I I I I I I I I I I I I I I t I ! I I I I I II I M I I I I I I i I 
g 1 4 9- 1 WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 

730 740 750 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 605>: 

al4 9-l . seq 

1 ATGGCACAAA CTACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCTCCCAAG CGCATGGAAC TGAGCAATCA GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCCAC TTCGGGGCTG 

151 CTGCACACTT CTACCGCCTC CGACAAAATC ATCAGCGGCG ACACCTTGCG 

201 ACAAAAAGCC GTCAACTTGG GTGATGCTTT AGACGGCGTA CCGGGCATTC 

251 ATGCCTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACAGGCAGAC GGATTAAAGT GTTGAACCAT CACGGCGAAA CGGGCGACAT 

351 GGCGGACTTC TCTCCAGACC ATGCAATCAT GGTGGACAGC GCCTTGTCGC 

401 AACAGGTCGA AATCCTGCGC GGTCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGCAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGAACTCGG ATTGCGTTTG AGCAGCGGCA 

551 ATCTGGAAAA ACTCACGTCC GGCGGCATCA ATATCGGTTT GGGCAAAAAC 

601 TTTGTATTGC ACACGGAAGG GCTGTACCGC AAATCGGGGG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAC GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTTG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCAATAT GGTCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATTA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACC TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTTCACGACG ACGATGATGC 

951 ACACGCCCAT GCCCACAACG GCAAACCTTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAAC CGTTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ACCTGAACCG CAACGACTAC CGCCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTTT TTAACAACCA AACGCAAAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG TGCTTTATCT GCCACATCCG AAGCGGTCAA 

12 51 ACAACCGATG CTGCTTGACA ATAAAGTGCA ACATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAACTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGCTACGAC AAAGCATTGA TTGATCGGGA 

14 01 AAACTACTAC AACCATCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCATT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACAACACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAGCT 

1551 GTACGCACAC GGCAAACACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA CTCTACCGCA ACCGCTTCGG 

17 01 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 
1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCG 
1801 GACTTCTACG GCGCGGAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

18 51 CCGCATCGGC GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 
1901 CTTCCCTACC CGGCAGGGAA GACGCCTACG GCAACCGCCC ACTCATTGCC 
1951 CAAGCCGACC AAAACGCCCC TCGCGTTCCG GCTGCGCGCC TCGGCGTCCA 
2001 CCTGAAAGCC TCGCTGACCG ACCGCATCGA TGCCAATTTG GACTACTACC 
2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG CACGCCCGGA 
2101 CACCATATGC TCAACCTCGG CGCAAACTAC CGCCGCAATA CGCGCTATGG 
2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 
2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 
22 51 ACCGGCGGCG TGAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 606; ORF 149-La>: 

al49-l .pep 

1 MAQTTLKPIV LSILLINTPL LSQA HGTEQS VGLETVSVVG KSRPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRR1KVLNH HGETGDMADF SPDHAIMVDS ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGELGLRL SSGNLEKLTS GGINIGLGKN 

201 FVLHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDQY GLPAHSHEYD DCHADI IWQK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDDDDAHAH AHNGKPWIDL RNKRYELRAE WKQPFPGFEA 
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351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

401 YLGQKSSALS ATSEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

4 51 EKQKASIRYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE. VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 

651 QADQNAPRVP AARLGVHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

7 51 TGGVNVKF* 

al49-l/ml49-l 98.0% identity in "758 aa overlap 

10 20 30 40 50 60 

al4 9-l .pep MAQTTLKPI VLS ILL INT PLLSQAHGTEQSVGLETVSVVGKSRPRATSGLLHTST AS DKI 
I t I I I I I M I I I I I I I I I I I t : I I I Mill I I I I I I I I I I I I M I I I I I I t I I I I I I I 
ml 4 9-1 MAQTTLKPI VLS ILL I NTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTST AS DKI 

10 20 30 40 50 60 

70 80 90 100 110 120 

a 1 4 9 - 1 . pep I SGDTLRQKAVNLGDALDGVPGI HASQYGGGAS APVI RGQTGRRI KVLNHHGETGDMADF 
| | | M t I It I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I t M I I I I I ! 
ml 4 9-1 I SGDTLRQKAVNLGDALDGVPG I HASQYGGGASAPV I RGQTGRRI KVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

al4 9-l .pep SPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I I I I I I I I I : I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I 1 I I I I I I I I I M I I I I I I M 
ml 4 9-1 SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
130 140 150 160 170 180 

190 200 210 220 230 240 

al 4 9-1. pep SSGNLEKLTSGGINIGLGKN FVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
M I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 4 9-1 SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 ' 260 270 280 290 300 

al49-l.pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 
M | I | I I E 1 I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
ml 4 9- 1 SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

al 4 9-1 .pep DYDNPGLSCGFHDDDDAHAHAHNGKPWI DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
| | | | | | | I I I 1 I I I I : I I I I : I : I : I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
ml 4 9-1 DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

al4 9-l . pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSEAVKQPM 

I 1 I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 

ml 4 9-1 RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
370 380 390 400 410 420 

430 440 450 4 60 470 480 

al4 9-l .pep lldnkvqhysffgveqanwdnftleggvrvekqkasirydkalidrenyynhplpdlgah 

I I I 1 I I It 1 I I I I I I I I I I I I I I I M I I I I I I I I I I I : I I I I I I I I I I I I I 1 I I I I I I I I 
ml 4 9 - 1 LLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKAS IQYDKALI DREN Y YNHPLPDLGAH 

430 440 450 4 60 470 480 

490 500 510 520 530 540 

al4 9-l .pep RQTARS FALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
I | | M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml49-l RQTARS FALSGNWY FT PQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

al4 9-l .pep SNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ml 4 9-1 SNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

550 560 570 580 590 600 



610 620 



630 640 650 660 
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al 4 9-1 .pep DFYGAEGEI YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPLI AQADQNAPRVP 

i I I I I I I I I I I I I I I I I I I I I I I E I I I I I I I I I I I I I I I I I I M I I E : I I i III 

ml 4 9-1 DFYGAEGEI YFKPTPRYRI GVSGDYVRGRLKNL PS LPGREDAYGNRPFIAQDDQNAPRVP 

610 620 630 640 650 660 

670 680 690 700 710 720 

al4 9-l -pep AARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 
I I I I I | I I | I 1 1 I t M I I I I 1 I I I I I I I I I f I I I I I I I I I I M I I I I I I I i I i I I I I I I 
ml 4 9-1 AARLGFHLKASLTDRIDANLDYYRV FAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN 

670 680 690 700 710 720 

730 740 750 759 

a!4 9-l .pep WYVKADNLLNQSVYAHSSFLSDTPQMGRS FTGGVNVKFX 
I I I I I I I I I I I I I I I I I I I I I I I M I I I I I ! I I I I I I I I 
ml4 9-l WYVKADNLLNQSVYAHSSFLSDTPQMGRS FTGGVNVKFX 

730 740 750 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 607>: 

gl50.seq (partial) 

1 . . TACTGCAAGG CAGACCCCTT TCCCGCCGCC CTGCTGGCCA AT C AG AAAAT 

51 CACCGCCCGC CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA 

101 GCGGTTCGGA TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT 

151 GACAACGATC CGGCACTGGT CGGGGAAAT C CTAGACCTGC TCGGCATCAA 

2 01 TCCGGCAACG GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG 

2 51 CACTGTTATC CCATTTCGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA 
301 GGCTATGCCA CGTTCGCCGA TAATGACGAA CTCGACCGTA TTGCTGCCGA 

3 51 CAACGCCGTT TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGGTGTGC 
401 TGCACCGCTT CCCGGCAAAA CTGACGGCGG AACAATTCGC CGGCCTGCTG 

4 51 CGCCCGCTTG CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGC 
501 GGGGGACGAA GTGCACCTGA CCGTCGGCGC AGTGCGTTTC GAACACGAAG 

5 51 GGCGCGCCAG GGCGGGCGGC GCATCGGGTT TCTTTGCCGA CCGGCTGGAA 
601 G AGGACGG C A CGGTGCGCGT GTTTGCGGAA CGCAACGACG GCTTCAGGCT 

6 51 GCCCGAAGAC AGCCGCAAGC CGATTGTGAT GATCGGCTCC GGTACCGGCG 
701 TCGCACCGTT CCGCGCCTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA 
751 GGCAGAAACT GGCTGATTTT CGGCAATCCG CATTTTGCCG CCGACTTCCT 
801 CTATCAGACC GAATGGCAGC AGTTTGCCAA AGACGGCTTC C TG CAC AG AT 
851 ATGACTTCGC CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC 
901 AAAAT CCGCG AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC 
951 G C AT AT CTAT GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GAAGTGGAAG 

1001 CCGCCTTGCT GGATGTGATT ATCGGGGCAG GGCATTCGGA CGAAGACGGC 
1051 GCAGAAGGAT ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA 
1101 TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 608; ORF 150.ng>: 

glSO.pep (partial) 

1 . . YCKADPFPAA LLANQKITAR QSDKDVRHIE IDLSGSDLHY LPGDALGVWF 

51 DNDPALVGE I LDLLGINPAT E I QAGGKTLP VASALLSHFE LTQNTPAFVK 

101 GYATFADNDE LDRIAADNAV LQGFVQSTPI AGVLHRFPAK LTAEQFAGLL 

151 RPLAPRLYSI SSSQAEAGDE VHLTVGAVRF EHEGRARAGG ASGFFADRLE 

201 EDGTVRVFAE RNDGFRLPED SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 

251 GRNWLIFGNP HFAADFLYQT EWQQFAKDGF L.HRYDFAWSR DQEEKIYVQD 

301 KIREQAEGLW QWLQEGAHIY VCGDAAKMAK EVEAALLDVI IGAGHSDEDG 

3 51 AEGYLDMLRE EKRYQRDVY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 609>: 

ml50 . seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT CTGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGGCGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAACATCG CCGGCGAACG CCGCCTGCTG CTGGTT AC CT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGAAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

401 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 
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451 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

501 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA AT CAG AAAAT CACCGCCCGC 

751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCCGGCAACG 

901 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 

951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

1001 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CGTCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

1201 GTGCATTTAA CTGTCGGCGT GGTTCGTTTT GAACACGAAG GCCGCGCCAG 

1251 AACGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

13 01 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

13 51 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

14 51 GGCTGATTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 

15 01 G AATGGC AG C AGTTTGCCAA AGACGGCTTC CTGCACAGGT ACGATTTCGC 
1551 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAAT C CG CG 

16 01 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GC AT AT CT AT 
16 51 GTGTGCGGCG ATGCGGCAAA AATGG C AAAA GACGTGGAAG CCGCCTTGCT 
1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ID 610; ORF 150>: 

ml50 -pep 

1 MQNTNPPLPP LPPEITQLLS GLDAAQWAWL SGYAWAKAGN GASAGLPALQ 

51 TALPAAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV SRAELKDYKA 

101 KNIAGERRLL LVTSTQGEGE PPKEAWLHK LLNGKKAPKL DKLQFAVLGL 

151 GDSSYPNFCQ AGKDFDRRFE ELGAKRLLER VDADLDFTAS ANAWTDNIAA 

201 LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKAAPFPAA LLANQKITAR 

251 QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI LDLLGIDPAT 

3 01 EIQAGGKMMP VARALSSHFE LTQNTPAFVK GYAAFAHYEE LDKIIADNAV 

3 51 LQDFVQNTPI VDVLHRF PAS LTAEQFIRLL RPLAPRLYSI SSAQAEVGDE 

4 01 VHLTVGWRF EHEGRARTGG ASGFLADRLE EDGTVRVFVE RNDGFRLPED 
4 51 SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLIFGNP HFARDFLYQT 
501 EWQQFAKDGF LHRYDFAWSR DQEEKI YVQD KIREQAEGLW QWLQEGAHIY 
551 VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE EKRYQRDVY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 1 50 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF 1 SO.ng) 
from N. gonorrhoeae: 

ml50/gl50 



ml 50 .pep 
gl50 



ml50 . pep 
g!50 



ml50 .pep 



210 220 230 240 250 260 

LLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAALLANQKITARQSDKDVRHIE 

Mil I 1 I II I I I 11 I I I I I I It I I I I I M 
YCKADPFPAALLANQKITARQSDKDVRHIE 
10 20 30 

270 280 290 300 310 320 

IDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 

I I I I II I M I I I I I I I I I t I I I I I M I I I II I I I I : I I I M I I I I I : I I I II 1 I I I 
IDLSGSDLHYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 
40 50 60 70 80 90 

330 340 350 360 370 380 

LTQNTPAFVKGYAAFAHYEELDKI I ADNAVLQDFVQNTPIVDVLHRFPAS LTAEQFIRLL 



WO 99/057280 



428 



PCT/US99/09346 



It | M | | | I II I I : I I : I I I : I I I i I I I 1 I I I : I I I : I I t I I I I : I I M I I II 
gl50 LTQNTPAFVKGYAT FADNDELDRIAADNAVLQG FVQSTPIAGVLHRFPAKLTAEQFAGLL 

100 110 120 130 140 150 

390 400 410 420 430 440 

ml50 .pep RPLAPRLYSI SSAQAEVGDEVHLTVGWRFEHEGRARTGGASGFLADRLEEDGTVRV FVE 

I I | I I I I I I II I : i M : I I M I I I I I : I I I I M I I II : t I 1 I I I : M I M I I I I II I I : I 
g 150 RPLAPRLYSI SSSQAEAGDEVHLTVGAVRFEHEGRARAGGASGFFADRLEEDGTVRVFAE 

160 170 180 190 200 210 

450 460 470 480 490 500 

ml 50. pep RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGKNWLIFGNPHFARDFLYQT 
M I I I I M I I I It I I I I M I I 1 I I I I I It I I I I I II I It I I : I t I I I I M I I I I I I I I I 
g 150 RNDGFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAEGRNWLIFGNPHFAADFLYQT 

220 230 240 250 260 270 

510 520 530 540 550 560 

ml50 . pep EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 
I I I I I I I It I I I M I I I I It I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
gl50 EWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLWQWLQEGAHIYVCGDAAKMAK 

280 290 300 310 320 330 

570 580 590 600 

ml 50 . pep DVEAALLDVIIGAGHLDEEGAEEYLDMLREEKRYQRDVYX 
: I I I I M I I I I I II I I I : I II I I I I I I I I I I i I I M I I 
gl50 EVEAALLDVI IGAGHS DEDGAEGYLDMLREEKRYQRDVYX 

340 350 360 370 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 61 1>: 

al50 . seq 

1 ATGCAGAACA CAAATCCGCC ATTACCGCCT ATGCCGCCCG AAATCACGCA 

51 GCTCCTGTCG GGGCTGGACG CGGCACAATG GGCGTGGCTG TCCGGCTACG 

101 CTTGGGCAAA AGCAGGAAAC GGGGCATCTG CAGGACTGCC CGCGCTTCAG 

151 ACGGCATTGC CGACGGCAGA ACCTTTTTCC GTAACCGTCC TTTCCGCCTC 

201 GCAAACCGGC AATGCGAAAT CCGTTGCCGA CAAAGCGGCG GACAGCCTGG 

251 AAGCCGCCGG CATCCAAGTC AGTCGCGCCG AACTGAAAGA CTATAAGGCG 

301 AAAAAC AT CG CCGGCGAACG CCGCCTGCTG CTGGTTACCT CCACCCAAGG 

351 CGAAGGCGAA CCGCCGGAAG AAGCCGTCGT GCTGCACAAA CTGCTGAACG 

4 01 GCAAAAAAGC CCCGAAATTG GACAAACTCC AATTTGCCGT ACTGGGTTTG 

4 51 GGCGACAGCT CCTATCCGAA TTTCTGCCGG GCGGGCAAAG ATTTCGACAA 

501 ACGTTTTGAA GAATTGGGCG CAAAACGCCT GCTCGAACGC GTTGATGCGG 

551 ATTTGGACTT TGCCGCCGCC GCAGACGGAT GGACAGATAA TATCGCCGCA 

601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 

651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 

7 01 CAGACCCCTT TCCCGCCGCC CTGCTGGCCA AT C AG AAAAT CACCGCCCGC 

7 51 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 

801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 

851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCAGGCAACG 

901 GAAATACAGG CGGGCGGAAA AACCCTGCCG GTTGCCTCCG CACTGTTATC 

951 CCATTTTGAA CTCACGCAAA ACACCCCCGC CTTTGTCAAA GGCTATGCCC 

1001 CGTTCGCCGA TGATGACGAA CTCGACCGTA TTGCTGCCGA CAACGCCGTT 

1051 TTGCAAGGCT TTGTGCAAAG CACGCCGATT GCCGATGTGC TGCACCGCTT 

1101 CCCGGCAAAA CTGACAGCGG AACAATTCGC CGGCCTACTG CGCCCGCTTG 

1151 CGCCGCGCCT GTATTCGATT TCCTCGTCGC AGGCGGAAGT GGGGGACGAA 

1201 GTGCACCTGA CCGTCGGCGC GGTGCGTTTC GAACACGAAG GGCGCGCCAG 

1251 GGCGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

1351 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 

14 01 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

14 51 GGCTGTTTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 

1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGAT ACGATTTCGC 

1551 CTGGTCGCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 

1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 

1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
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17 01 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
17 51 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 



This corresponds to the amino acid sequence <SEQ ID 612; ORF 150.a>: 

al50 . pep 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



MQNTNPPLPP 
TALPTAEPFS 
KNIAGERRLL 
GDSSYPN FCR 
LLKEEAAKNR 
QSDKDVRHIE 



MPPEITQLLS GLDAAQWAWL SGYAWAKAGN 
VTVLSASQTG NAKSVADKAA DSLEAAGIQV 
PPEEAWLHK LLNGKKAPKL 



LVTSTQGEGE 
AGKDFDKRFE 
ATPAPQTTPP 
IDLSGSDLHY 



EIQAGGKTLP VASALLSHFE 
LQGFVQSTPI ADVLHRFPAK 



ELGAKRLLER VDADLDFAAA 
AGLQTAPDGR YCKADPFPAA 
LPGDALGVWF DNDPALVREI 
LTQNTPAFVK GYAPFADDDE 



LTAEQFAGLL 



VHLTVGAVRF EHEGRARAGG ASGFLADRLE 
SRKPIVMIGS GTGVAPFRAF VQQRAAENAE 
EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD 
VCG DAAKMAK DVEAALLDVI IGAGHLDEEG 



RPLAPRLYSI 
EDGTVRVFVE 
GKNWLFFGNP 
KIREQAEGLW 
AEEYLDMLRE 



GASAGLPALQ 
SRAELKDYKA 
DKLQFAVLGL 
ADGWTDNIAA 
LLANQKITAR 
LDLLGIDQAT 
LDRIAADNAV 
SSSQAEVGDE 
RNDGFRLPED 
HFARDFLYQT 
QWLQEGAHIY 
EKRYQRDVY* 



ml 50/al 50 94.8% identity in 599 aa overlap 

10 20 30 40 50 60 

ml 50 .pep MQNTNPPLPPLPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPAAEPFS 
| | | | | | | | M : I I I I I I i I I I I I I I I I i I I I I I I I I If I I I I I I I I I I M I 1 II : i I t I t 
a 150 MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml50 . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 
| | | | | | | | | | | I I I ! I I I I I I I I I I I I I I I II I I II I II I I I I I I I I I I I I I I I I I I I I I 
a 150 VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNI AGERRLLLVTSTQGEGE 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml50 pep PPKEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 
I I : I 1 I I I I I I I I I I i I i I I i I t I I I I I I I I I I I I I I I I : I I I I I I : I I I I 1 I 1 I I I I 1 I 
a 150 PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml50 . pep VDADLDFTASANAWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAA 
I I I I I I I : I : I :: I I I I I t II I I I I I I I I I I I I I M I I I I II I I I I I I I I I I I I Mill 
al50 VDADLDFAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml50 . pep LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPAT 
I I I 1 I 1 I I I I I I I I I I I I I I I I I I I I I I ) I I I II I I I I I I I I I II I I I I I i I I I M I II 
a 150 LLANQKITARQSDKDVRHIE I DLSGSDLHYLPGDALGVWFDNDPALVREI LDLLGIDQAT 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 50 .pep EIQAGGKMMPVARALSSHFELTQNTPAFVKGYAAFAHYEELDKIIADNAVLQDFVQNTPI 
I I I I I I I : I I I M I I I I I I I I I I IE I I I 1 I II : I I I : I M I I I I I I I I : I I I 
a lS0 EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLQGFVQSTPI 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml50 pep VDVLHRFPASLTAEQFIRLLRPLAPRLYSISSAQAEVGDEVHLTVGVVRFEHEGRARTGG 
: | I | I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I t : I I I I I I I I I I : I I 
a l50 ADVLHRFPAKLTAEQFAGLLRPLAPRLYSISSSQAEVGDEVHLTVGAVRFEHEGRARAGG 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 50 .pep ASG FLADRLEE DGT VRV FVERN DGFRLPEDSRKP I VMIGSGTGVAPF RAF VQQRAAENAE 

I I | I I I I I I I I I M II I I I I I I II I I I I I I I I II I I I I I I I I I I t I I I I I I I I I I I I I I I 
al50 ASG FLADRLEEDGTVRVFVERN DGFRLPEDSRKP I VMIGSGTGVAPFRAFVQQRAAENAE 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 50 . pep GKNWLIFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
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a 150 GKNWLFFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVQDKIREQAEGLW 
490 500 510 520 530 540 

550 560 570 580 590 600 

ml50 . pep QWLQEGAHI YVCGDAAKMAKDVEAALLDVI IGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

I t I I I I t I I I I I I I I I I t I 1 I t I I I I I 1 I M I I I f It I I I f I I I I I I I 1 I I I I I I I t f I I 
a 150 QWLQEGAHI YVCGDAAKMAKDVEAALLDVI IGAGHLDEEGAEEYLDMLREEKRYQRDVYX 

550 560 570 580 590 600 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 61 3>: 

gl51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCACACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA AT C CGGC AC A TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CG CGGC AT C A CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTG 

2 01 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

2 51 TGGAGCGCGT TTTGGGGATG GTGGATTGCG TCGTCTTGTT GGTGGACGCA 

3 01 CAGGAAGGTC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

3 51 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTTATC GAC CAGACTT TCGAGTTGTT CGACAACTTG 
451 GGTGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGTTT 
501 GAGCGGCTTT GCCAAGCTGG AAGAAAccga CGAGAGCAG C GATATGCGCC 
551 CGCtgttcgA CACCATCCTA AAATACAcgc CtgCACCGAG CGGCAGCGCG 
601 GACGAGCCGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 
701 AAAC CGTTGC CGTGATGAAC CACGAGCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGG CTTGGAA CGCGTGCCGC TTGAAGAAGC 
8 01 CGAAGCCGGC GACATTGTGA TTATTTCCGG TAT C GAAG AC AT CGG C AT CG 
8 51 GCGTAACCAT CAC CGAC AAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTAAACA CCAGCCCGCT 
951 CGCAGGTACA GAAGGCAAAT T CGTGAC C AG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTTGCCC TGCG CGTGG A AGACACCGCC 

10 51 GatgCCGACG TGTTCCGCGT ATCcgGGCGC GGCGAACTGC ACCTGACGAT 

1101 TTTGCTGGAA AATATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAGC 

1151 CGCGCGTCGT GT AC CG AG AC ATCGACGGTC AAAAATGCGA ACCTTATGAA 

1201 AACCTGACTG TGGACGTACc cgacgacAAC CAAGGCGCGG TAATGGAAGA 

12 51 ACTCGGCCGC CGCCGTGGCG AACTGACCAA TATGGAAAGC GACGGCAACG 

13 01 GacgCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

13 51 CAAGGCGAAT TCATGACCCT GACGCGCGGC GTCGGGCTGA TGAgccacGT 

14 01 GTTcgacgac tacgcgcccg tcaAACCCGA TATGCCCGGC CGCCACAACG 
14 51 GCGTactggt GtCCCaAGAG CAGGGCGAGG CGGTTGCTTA CGCCTTGTGG 
1501 AAT CTTGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 
1551 CGAAGGTATG ATTATCGGCA TCCACAGCCG CGACAACGAT TTGGTGGTCA 
16 01 ACCCGCTCAA AGGCAAAAAA CT CAC C AAT A TCCGTGCCAG CGGTACCGAC 

16 51 GAAG CGGTGC GCCTGACCAC GC CGAT CAAA CTGAcgcTGG AAGGCGCGGT 

17 01 CGAGTTTATC GACGATGACG AGCTGGTGGA AATCACGCCG CAAtccatcc 
1751 gcctgcgcat gcgttacctG AGCGaattgg aacgccgccg tcaTTTTAAA 
1801 AagctgGATT AA 

This corresponds to the amino acid sequence <SEQ ID 614; ORF 151.ng>: 

gl51 -pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGCHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESS DMRPLFDTIL KYTPAPSGSA 

201 DEPLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HEQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVED T A 

3 51 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

4 01 NLTVDVPDDN QGAVMEELGR RRGELTOMES DGNGRTRLEY HIPARGLIGF 
4 51 QGEFMTLTRG VGLMS HV FDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
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501 NLEDRGRMFV SPNDKIYEGM I IGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDE LVE I TP QSIRLRMRYL SELERRRHFK 

601 KLD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 5>: 

m!51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

3 01 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGAC CAAAA AAGCCTTGGC 
351 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAGCCGTCCG 

4 01 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 
4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 
501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 
551 CGCTGTTCGA TACTATCTTA AAAT AT AC GC CTGCACCGAG CGGCAGCGCG 
6 01 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 
701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC 
801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATCGAAGAC ATCGGTATCG 
851 GCGTAACCAT CACCGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 
951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

13 01 GACGCACCCG CCTCGAATAC CAT ATT C C AG CGCGCGGCTT GATCGGTTTC 

13 51 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAG CCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 

14 51 GCGTGCTGGT GT CCCAAG AG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

15 01 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG AC AAAAT CT A 
1551 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

16 01 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 
1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 
1751 GTCTGCG CAA GCGTTACTTG AG CGAATTGG AACGCCGCCG CCACTTTAAA 
1801 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 15 1>: 

ml 51 . pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP I VYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

3 01 VDEPTLTMDF MVNTS PLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

3 51 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 

4 01 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 
4 51 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
501 NLEDRGRMFV SPNDKIYEGM I IGIHSRDND LWNPLKGKK LTNIRASGTD 
551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRKRYL SELERRRHFK 
601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 



